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ON THE EFFICIENCY OF DIFFERENT SFA®CH STRATEGIES 

C, Ktshler and r, Watner 
Carman Cancar Research Center, Heidelberg 
Inatituta for Documentation, Information and Statistics 



v . SYNOPSIS 

Five wall known search strategies in EDP have bean 
compared with a newly developed technique. The latter 
proved more efficient, both with retard to the mean 
number of access points and the probability of a hit 
within the first five attempts. 
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For the task of information retrieval in computer-based 
systems different search strategics have been developed, 
the efficiency of which is to be compared (cf. Table 1 ) » 
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Effioienoy of Search Strategies 



1. ) Sequential Starch 

In searching sequentially, a file of tl elements is checked 
with a key-word record per record until the element searched 
for has been found. Previous sorting of the material is not 
required. The mininun number of comparisons eouala l, 4 the 
maximum v __„ * ?»', therefore the mean number v _ a ?l ■ - 

TTAX n 7 • 

If M elements are searched for simultaneously, the mean 
number of attempts per element will he v n s ^ 

2, ) Binary Search 

The precondition for using the binary search method is strict 
order and a knowledge of the number of elements in the file. 
At every step the part of the file which is to be searched is 
subdivided into 3 parts, namely 

1, the element addressed for the purpose of comparison 

2, any "smaller" elements 

3, any "larger" elements. 

The element addressed is compared with the concept to be 
searched for ("eoual", "smaller", "larger"). According to 
the result of this comparison, the search is either completed 
(elesMnt has been found) or it is to be continued within 
the "smaller" or "larger" elements only. The search strategy 
is optisial if the remaining quantity of elements is halved 
at each step. In this case, the ranges of the jump form a 
monotonously decreasing senuence with the factor 1/2, 

For the N elements of a file N » 2* - it this leads to the 
mean number of access points 



3.) Block Search 

Using the block search recuires the subdivision of a sorted 
file into -blocks of length D, The first comparison takes place 
with the last element of the first block, Tf the key-word 
to be searched for is "larger", the programme jumps to the 
last element of the second block where renewed comparison 
takes place. The procedure is continued until the addressed 
element is "equal" to or "smaller" than the key-word. If it 
is "equal", the desired element is found! otherwise, the 
addressed block is looked up seouentially until the desired 
element haB been found. 



v 



m 
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V Evaluation of Retrieval Effeotiveneee 



In a file of H elements with block lenrth « D there are 
|f/D blocks. In order to address all these seouentially, 
a maximum, of U/D steps is necessary. Within a block a 
maximum of D-l steps is renuired to find a certain element 
This leads to a mean number of steps 



v 

m 



I? ♦ D 






4,) Chaining Methods 



There are two different chaining methods, the single chain 
and the double chain. In the single chain method the address 
of the subseouent element is stored in addition to each 
element. The entire file is searched element by element, 
usinc the chained address, until the element searched for 
has been found. The single chain method corresponds to the 
sequential search in principle; the mean number of search 
steps reouired is the same, namely 

v n a w ; 1 

The double chain method is only possible in strictly 
hierarchical structures. In this method 2 addresses are 
stored simultaneously with each element - namely the one 
of a neighbouring element on the same level and the one 
of a neighbouring element on the next lower level. 

The mean length of search is 

Ym » 1.4 lot}** 

5.) Distributed Key-Search 

The distributed key search requires a strictly 
hierarchical order of the elements in the file. Each 
level is characterized by a number; each element of 
the file is unequivocally determined by the combination 
of all numbers. On the basis of the complete combination 
of the numbers any desired element can be retrieved. 

With a symmetrical order of the material and a mean number 
of branches of about 4 there is an optimal mean number of 
access points of 

Y n « 1.24 lott 7 N 

However, this optimal value is rarely reached, since the data 
collections are hardly ever optimally structured. 
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(512) BLASTOGENESE 

TRANSFORMATION BLASTIQUE 
BLASTOGENESIS « 

BLEI 

PLOMB 

LEAD 

BLUT 

SANG 

BLOOD 

BLUTDRUCK 

PRESSION SANGUINE 
BLOOD PRESSURE 

(516) BLUTGEFASSE 

VAISSEAUX SANGUINS 
BLOOD VESSELS 

BLUTGRUPPE 

CROUPE SANGUIN 
BLOOD GROUP 

BLUTSERUM 

SERUM SANGUIN 
BLOOD SERUM 

BLUTVOLUMEN 

VOLUME SANGUIN ~ 
BLOOD VOLUME 



(524) BOR 

' > BORE > ' 

BORON 

Fir*!: Part of the Threelin*ual Thesaurus 
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V Evaluation of Retrieval Effeotiveneee 



None o f the above named nearch strategies appeared to us to be 
ontinally suited to the practical task of translating key-words 
in a trilingual thesaurus (F jgsl) in alphabetical order into 
another language. V f e, therefore.” developed a search algorithm of 
our ov:r - the so-called serial mnp-search . The application of 
this method presupposes sett i ng up a Correspondence table 
containing the addresses within groups of the first 2 letters 
(cf. Table 2). 
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This method is to bs briefly explained by the following examples 
The first step towards ths solution of the task consists in 
comparing the 2 first letters of the word to be translated with 
the correspondence table) thereby the beginning address for the 
search is established • 

The word "BlutgefAsse" (blood vessels) is to be translated from 
Cerman into French. The first two letters of the word "Blutgefasse" 
(BL) lie between BE and CA. According to the correspondence table, 
the search interval ranges from address 452 to address 595 (Tig. 2? . 
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Efficiency of Search Strategiee 




?! 



Description of the Serial Jump Strategy 

't* m ~ ’• 
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V Evaluation of Retrieval Effeotiveneee 



The number of elements to be compared from the start to the end of 
the search interval is called D. Tn our example D actuals 144, D is 
subdivided into X segments (in our example X = 12). The first 
element of each segment (beginning with the second segment) is 
compared as to whether the word to be searched for is equal to, 
lareer than or smaller than the one addressed. If the word to be 
searched for is enual, the search is completed. If it is larger 
than the word found, then the programme jumps to the first address 
of the next segment (in our case = 476) and this address is 
compared. The word "Bein" (lee) found at this address is smaller 
than the search word "Blutgef3sse"f therefore, the search is 
continued at address 480 and so on. If the word to be searched for 
is smaller than the word found (e.g. "BlutgefSsse* is smaller than 
"Bor" TBoronl ) , then the search is continued serially from the 
second ‘word of the preceding segment until the word has been found 
within this segment*. 

The number of access points needed to find a certain word is 
composed of the number of jumps and the number of steps within a 
segment. (In our example 10 search steps were reouired, namely 6 
segment jumps and 4 steps in the last segment.) 

The general rule is: 

minimum number of access points Tnin * 1 

maximum number of accwss points * (x«l)*(jf *1) * *♦§ ■ 2 



Dependent on the size of the search interval D, the value of X can 
be optimized by partial differentiation of the equation (1) 

(M «\P). 

When inserting the values D * 144, X « 12 in equation (1) we 



In using D * 81 (X * 9) it follows that Y ffi « 6.S, 

With a total of 5.000 words in the thesaurus this value is 
already less than the corresponding value of the binary search. 

Prom equation (1) it follows under optimal conditions that 



With N * total number of elements in the file and S * number of 
intervals defined by the correspondence table 



mean number of access points * 




obtain Y_ * 
in 



144 



♦ 



11 



* 11.5. 




( 2 ) 
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D * § (3) 

Fro* (2) and (3) it follows that - 1/2 

This relation has been calculated for different values of I! and S 
and is shown by a family of curves (Fig. 3 ) . Mean numbers of access 
points ^ for the binary search are incIuHeH for the purpose of 
comparison. Fig. 3 shows that our search strategy is better for 
a given value of H, if the number R of the address fields in the 
correspondence table is large enough. 




* Y«i 



Fis. 3 : The mean number of access points in serial jump 
strategy depending upon the size of the material 
to be searched and the number of address fields 
in the correspondence table compared with the 
corresponding values for the binary search. 
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Binary Search 
No. of address 
No. of address 
No. of address 
No. of address 
No. of address 
No. of address 
No. of address 



fields » 10 
fields » 20 
fields * 25 
fields * 35 
fields * 50 
fields * 70 
fields « 100 



N ■ number of the elements of a file 
Y 3 mean number of access points 




EVALUATION OP RETRIEVAL EFFECTIVENESS 

T.M. Ait chi a on and Angela M. Hall 
INSPEC, Institution of Electrical Engineers 9 London 



SYNOPSIS 

A comparative evaluation is described of a number of 
controlled and 'natural* languages which were candidates 
for use for retrieval from the INSPEC data base and for 
* SOI. The problems involved in such an evaluation and the 
general conclusions reached are discussed. 

The comparative evaluation of index languages arose from the need to 
decide on the main subject organising and retrieval tool to be used 
in INSPEC 9 a integrated computer-based system, one aspect of vhich is 
described by Simmons 9 Vernon and Martin in a paper in Area 4. Since 
the basis of the system is that each item included in the data base - 
at least 125 » 000 in 1971 - would be input only once, it was important 
to try and ensure that the index language used should be both the 
most suitable one available and equally useful for the printed 
indexes to the abstracts Journals produced by INSPEC, for a selective 
dissemination of information (SDl) system, and for retrieval .from the 
data base • 

It Is probably true to say that the choice was not completely open 
since a classification system (whether U.D.C., a true hierarchical 
one or a faceted classification) was never seriously considered and 
there was an implied assumption that whatever language was chosen 
should be employed in a coordinate- indexing mode. 

Like most information workers we had had a fundamental conviction 
that a controlled language must be superior to an uncontrolled or 
'natural 9 language v and that the performance of an information 
retrieval system was mainly dependent on the degree of intellectual 
effort involved in the setting up of the index language and in the 
indexing. However, the results of the second Cranfield tests (l) 
suggested that these convictions or innate assumptions might not be 
entirely valid, and that, at the very least, the whole question 
should be considered open. 
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It was on that basis that the possibility of conducting a comparative 
evaluation first presented itself. Although it was desired to select 
a language equally suitable for printed indexes, SDI and retrospective 
searching of a machine file, only for retrospective searching had a 
methodology been developed (by Cleverdon, Mills and Keen for the 
Cranfield (l) tests). This, although it had its share of criticism, 
seemed sensible, understandable and had the decided advantage that its 
critics had not been able to suggest an alternative. It was hoped to 
avoid at least some of the difficulties and forestall some of the 
criticism in the design of the new evaluation. In particular it was 
intended to take advantage of the INSPEC facilities by having soft* 
ware developed to allow the evaluation data to be obtained by program 
and by using, for the evaluation, documents already indexed for the 
normal production of INSPEC services* 

The index or retrieval languages which it was decided should be 
studied t since they were already available or potentially available 
in the INSPEC database, comprised t 

1. , Title 

2. Abstract and title 

3. Hybrids the subject heading and plain- language modifier 

line used in the printed indexes to the abstracts 
journal (i.e. a combination of controlled and 
uncontrolled language). 

4. Free-indexs free- language terms applied by an indexer 

(applied initially, in the SDI Investigation 
before translation into the controlled 
language ) • 

5* Controlled! a thesaurus-based controlled language used 
in the INSPEC SDI Investigation. 

It is obvious that the main problem in comparing these five options 
lies in ensuring that the controlled language is compared fairly 
with the other four languages in their free-language formulations. 

Ve hoped to overcome the problem by using a vide variety of free- 
languago formulations and as many controlled- language formulations 
as possible. 

Methodology 

the design of the evaluation (2) vaa based on Batching a test 
collection of documents with a test collection of questions to which 
each of the documents had been assessed for relevance. The documents 
in the collection required to be indexed by each of the languages 
to be evaluated, and the questions formulated in the same language. 
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Retrieval Effeativeneee 



The aain difficulties that may be expected in setting up such an 
arrangement are that the document indexing for the test may be 
consciously or unconsciously biassed in favour of one of the 
languages | the questions or their formulation may favour one 
language | or it may be difficult to persuade the questioners to 
assess a large enough test collection of documents for relevance 
to their questions* 

In the INSPEC evaluation it is considered that all of these difficul- 
ties were overcome v some more successfully than others* 

In the indexing of documents the possibility of bias was entirely 
ruled out by selecting documents which had been routinely indexed 
before the test was postulated. The indexing had been carried out by 
different groups (i*e* Hybrid by one group and the two SDI languages 
by another) but it was considered that this was not a disadvantage 
since for INSPEC the main interest was in comparing the total system 9 
i*e* both the language and its use* 

The possibility of the choice of questions influencing the comparison 
was accepted rather than avoided by using 'real' questions submitted 
by a vide variety of electronics research workers 9 while the 
formulation difficulty was reduced, if not overcome, by using a total 
of 30 different formulations for each question, i.e* 10 question 
formulations, each in three different logical product coordinations 
or strategies* 

The different question formulations used are shovn k in Figure 1* The 
three strategies required the coordination (by 'AND') of the minimum, 
a moderate, or the maximum number of tenns in the question and 
corresponded to low, medium and high exhaustivity of search* 

An example of the A and C formulations for a question is given in 
Figure 2* 

In the assessment of documents for relevance to the questions, a 
method was found by which the questioner, with comparatively little 
effort was able to assess some 1200 documents* This depended on his 
having already assessed the documents for relevance to his profile in 
an SDI service and his being supplied with the profile- relevant 
documents for assessment of relevance to his question* The method 
worked satisfactorily in the main, although some assessments had to 
be discarded when it was found that some questioners had again 
assessed the question documents for relevance to their profile. A 
more serious consideration arises from the fact that the assessments 
were highly subjective, peculiar to each questioner, instead of ob- 
jective and reproducible as would have been the case if a consensus 
Judgement had been obtained* However it was considered, that the 
subjective assessment represented the normal situation and should 
therefore be retained* 
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Quest ion formulations 

Single- 1 era free- language formulations 

A. The words of the question as stated by the 
questioner (and 9 where necessary, selected 
for the evaluation) • 

B* The words of 'A f with true synonym and other 
word form added as alternatives f and word 
endings confounded as appropriate. 

C. As *B' with the addition of quasi- synonyms and 

generic, ho re specific, or related terms as were 
considered likely to be advantageous* 

Simnle-concept f ree- language formulations 

In these formulations, sisple adjectivs-and-noun 

phrases, and a small number of others were preserved* 

E. As ’A' with simple concepts preserved* 

F* As 'B* n " " " 

G* As ’C* w ' n " " 

Hybrid (free-and con trolled- language) formulations 

D* As 'C* with appropriate Science Abstracts subject 

headings added as an additional set of search terms* 

H* As ‘D 1 but based on ‘G*. 

Controlled- language formulations 

R* The words in the question translated by strict use 
of the thesaurus into the controlled language used 
in the SDI Investigation* 

S. As •R* but with free use of alternative terms, similar 
to the free- language formulations •C 9 and 9 0 9 * 



Figure 1 



376 



•23 



Retrieval Effeativeneee 






Questions 


Info net ion on the crystalline 
of cadaduM sulphide? 


structure 


Field 


Fonaulation A 


Fonauletion C 


10 
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cryst* 

xtal* 

nicro structure* 


11 


structure 


structure 

orient* 






lattice* 

• 

nicrostructur* 

state 


12 
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cedMiuM 
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Cd 
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sulphide 


sulphide^ 
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DEVIL 



To carry out tho evaluation the DEVIL (Direct Evaluation of 
Index Languages) system was developed by INSPEC. By this systeu 9 
which is based on the INSPEC general-purpose file maintenance system 9 
the question and documont data were input 9 validated and corrected! 
searches for matches were carried out as required! and (since the 
relevance assessments wero included in the tost collection files) tho 
results woro roported and the performance measures calculated by 
program for each formulation of each question and for the equivalent 
groups • 

Matching tests 



In the tests a total of 97 questions were used in tho main set 9 and 
82 in a subset from which questions with particularly doubtful 
(probably erroneous) rolevanco assessments had been removed. Hie 
document collection comprised 542 documents each 'indexed' by the 
five languages. An example of the indexing of an input document is 
given in Figuro 3. 

Tho evaluation consisted essentially of matching each formulation of 
oach question against the appropriately-indexed document collection 
in two modos 9 for which tho terms 9 Plain Coordinate Matching (PCM) 
and Booloan matching (Booloan) woro coined. In tho P.C.M. mode 9 a 
match on all the 'n' terms in tho question was first roquirod 9 then 
of any n-l 9 then of any n-2 f etc. until single-term matches were 
reached. Such an output is shown in Figure 4. 

In the alternative modo 9 Booloan y a logical relationship was 
spocifiod for each question. Thus oach question might require a 
difforont number of terras to bo coordinated. An example of such an 
output is shown in Figuro 5- 

Three performanco raeasuros were calculated! 

Recall (rolating the numbor of relovant documents retrieved 
to tho total numbor of relevant in the collection) 

Precision (rolating tho numbor of relevant documents rotriovod 
to the total numbor of documents retrieved) 

Fallout (rolating tho numbor of non-relevant rotriovod to 
tho total non-rolovont in tho collection) 

For oach formulation of tho sot or subsot of questions 9 the rosults 
for the individual questions wore averaged by two methods and 
totalled by four difforont systems 9 since each of these had cortain 
advantages but none was froe from some drawback. 
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An example of the indexing of an input document 



Titles 


The ultrasonic interface micrometer 


Abstracts 


In nuclear reactors » thickness of the walls 
of tubes are measured by ultrasonic continuous- 
wave and pulse-wave micrometers. The principles 
and recording techniques of the instruments are 
described. The continuous -wave micrometer is 
used for simultaneous determination of thickness 
during defect- testing operations; the pulsed- 
wave instrument is preferred when better surface 
discrimination is needed. 


Hybrid s 


CBNEMDt Measurement by electrical methods 9 

distance, nuclear reactor thickness 9 
U.S. interferometry. 

CBMAs Measuring apparatus 

CBNPs Nuclear power 

CBUs Ultrasonics, tube thickness me as 
in nuclear reactor 


Free- index s 


Ultrasonic interference micrometer, thickness 
measurement DM, continuous wave micrometer, 
pulsed wave micrometer, design, recording, 
applications. 


Contz’olleds 


Ultrasonic waves, interference, dimensions, 
measurement, continuous waves, pulses, design, 
recording, applications. 



Figure 3 
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Plain coordinate watching output 

Questions C 030 Total documentss 542 

Document indexings Free- index Question descriptors s 4 

Relevant ‘documents! 2039 / 1 9 2273 / 1 » 3067/1 f 322l/l 



Documents 

2039/1 

3067/1 

1155/0 

1050/0 

1097/0 

1187/0 

2030/0 

2060/0 

2219/0 

2274/0 

3214/0 

4250/0 



4137/0 

2273/1 

1081/1 

1131/0 

2005/0 

2031/0 

2166/0 

2232/0 

3082/0 

3221/1 

4287/0 



1084/0 

1133/0 

2021/0 

2048/0 

2172/0 

2235/0 

3104/0 

4117/0 

4299/0 

Figure 4 



Tot Rel 

Ret Ret R 0 H Vf 

1 1 25.0 0.0 100.0 

3 2 50.0 0.2 66.7 

5 3 75.0 0.4 60.0 



32 4 100.0 5.2 12.5 



Boolean Output 



Question 


formulations C 




Total documents s 542 


Document 


indexings Free- index 
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C 083 
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1 


1 


50.0 
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0 
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0 


0.0 
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3003/1 


1 
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1 
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0.0 100.0 


C 086 


2011/1, 2210/0 
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1 


1 


100.0 


0.2 50.0 
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Results 



The matching of the 542 documents and 97 questions in 10 formulations 
by 3 strategies, and the use of two methods of averaging and four 
types of totalling produced a very large quantity of data (3). 

Examples of Recall/Precision plots of P.C.M. and Boolean outputs are 
shown in Figures 6 and 7 respectively. 

The data were supplemented by analyses of failures in recall and 
precision (i.e. why relevant documents were not retrieved and why 
non- relevant documents were retrieved). 

i 

The results showed that the controlled language was much superior 
to any of the others. Of the remainder the free- indexing was 
superior, followed by the combination of subject headings and 
modifier line, then title, and title-and-abstract • 

The title-and-abstract was shown to have the highest exhaustivity 
and to give the highest Recall of any of the languages, but it had 
easily the worst Precision. In other words it has sufficient terms 
to provide many of the required matches, but many other words which 
produced matches with non-relevant documents. 

Hie low exhaustivity of the title gave a consistently low Recall but 
conversely a very high Precision, superior to any of the other 
languages. Thus its use could be recommended for situations in which 
only a proportion of all the relevant documents, with as few as 
possible non-relevant, are required. 

The exhaustivity of the combination of the subject heading and 
modifier line of the abstracts journals was found to be only slightly 
better than that of the title while the Precision was slightly worse. 
Thus its overall performance was only a little superior to that of 
the title. 

Hie free- indexing was shown to have an exhaustivity only slightly 
worse than that of title-and-abstract and to have a similar maximum 
Recall, while its Precision was little worse than the subject-heading/ 
modifier combination. It was therefore superior to the other three 
uncontrolled languages and only outranked by the controlled language. 

Probably the most important outcome of the evaluation came from a 
comparison of the free-indexing and controlled language. Sinco the 
document indexing based on these languages contained the same 
concepts (the controlled- language indexing being the translation of 
the free-indexing into controlled terms), it followed that the 
superiority of the controlled language must be attributed to the 
question formulation. The failure analysis showed that tho super- 
iority arises from tho built-in ability of tho controlled language 
to include in the formulation all the alternative words and concopts 
which might be used to express the question. 
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This is a function of the thesaurus and in particular of the procedure 
of identifying new terms as they occur in the indexing of the input 
documents and adding them to the thesaurus in their correct relation- 
ship to the other terms. This suggests that a similar retrieval 
performance might be obtained with free- language indexing by the use 
of a f roe- language thesaurus for assistance in question formulation. 
Each new term as it occurred in the document indexing would be added 
and its relationship to the other thesaurus terms indicated. 

It is on these grounds that the decision was made to adopt free- 
indexing as the basic medium for information retrieval and SDI in the 
INSPEC system. At present y subject headings and modifier lines are 
still used in the printed indexes to the abstracts journals 9 but f as 
described in the paper by Field in Area 9 the subject headings are 
being related to the free-index terms in a thesaurus structure and 
this will be used as the basis of the free-language thesaurus. 

Although in setting up the evaluation project it was not intended 
to seek results that would necessarily be applicable to other than 
the INSPEC environment | it is believed that the main conclusions - 
the superiority of the controlled language and the expectation of 
obtaining a similar performance by the combination of free-indexing 
and a free-language thesaurus - are generally applicable. 



1. Cleverdon 9 C. t Mills t J. 9 . and Keen t M. ; Factors 
determining the performance of indexing systems 9 
vol.ll design; vol.2l test results; Cranfield; 
Aslib Cranfield Research Project 9 19 66. 



2. Aitchison* T.M. 9 and Tracy f Jennifer M. ; Comparative 
evaluation of index languages. Part II design. 
Report No.R«70/l. London 9 INSPEC 9 The Institution of 
Eloctrical Engineers 9 July 1969 • 



I). Aitchison v T.M. 9 Hall, Angela M. 9 Lavollo 9 Katherine H. 9 
and Tracy 9 Jennifer M. | Comparative evaluation of 
indox languages. Part III rosults. Report No.R.70/2. 
London 9 INSPEC 9 The Institution of Eloctrical Engineers 9 
July 1970. 
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A RESEARCH TECHNIQUE TO COMPARE THE RETRIEVAL 
EFFECTIVENESS OF ABSTRACTS AND INDEXING TERMS 

T. Bloch, U* Bloch and K*D* Ofer 



SYNOPSIS 

The method proposed to compare retrieval effectiveness of 
descriptors and abstracts, endeavours to minimize subjective 
decisions by duplicating the basic function of extrsctlng 
the terms from the abstract and by restricting the subject 
specialist to a few prescribed decisions. The comparison 
between the two surrogates Is based on comparing the number 
of terms In the various categories, but does not allow for 
weighing different terms as to their possible effectiveness 
In retrieval* 



The Problem 

When using tape services as the Input for a mechanized Information service, one cannot 
simply merge the different services together, not only becsuse their formatting Is not 
stsndardlzed, but also becsuse the document surrogates very from service to service, so 
that the search profiles must be prepared separately for esch* 

Some servlcca give (In addition to the citation, the author and the title), sn abstract 
and others a set of Indexing terms* Several tape services Include both surrogates. 

In this lsst case, It Is of Interest to know, whether one of them, the title together 
with the abstract, or the set of Indexing terms - would give as good a retrieval on 
Its own as together with the other* In other words, whether the set of terms available 
from tlie descriptors ( M D M ) and the set of terms from the abstract ("A") are completely 
overlapping, partly overlapping, or completely disjoint* It would also be of Interest 
to know, which set usually comprises more specific terms than the other • 

Once we know the answer to these questions, It should facilitate the design of a better 
search strstagy, e*g* If we know that the terms In the abstract sre more specific thsn 
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look first at the 
bother to search 
search, where the 
still be available 

for empty searches* Alternatively, looking only at "D" terms would make for faster 
throughput of a serial file. 

Naturally, these decisions will not be the same for all tape services, but aay change 
from one to the other depending on the quality of the abstract and of the indexing 
terms. This study shows a research design that could be employed routinely to decide 
upon the above problem applied to a specific case* 

Definitions 

1* Equal or synonymous terms are called “joint” terms* 

Expressions are “synonymous”, if we can assume, that an Information scientist 
would use either In the construction of the same profile* 

2* Where one term is more specific than the other, the two terms are called 
•’inclusive" terms. 

3. Any term which is neither joint nor inclusive Is a "unique" term, 

4. The set of all terms used as descriptors for a document la Its "set of descriptor 
terms" or "D". 

5. The set of all terms appearing In the list culled form the title, the bibliogra- 
phic citation and the abstract of a document is called its "set of abstract 
terms" or "A". 

The Method 

The decisions are based on comparisons between the number of terms in the two sets. 
The number of terms in "D" is usually not difficult to ascertain, but defining the 
set "A" is more laborious. One cannot simply count the words left over after elimi- 
nating all words from n copious "stop list", but one must ensure that only "semantic" 
words are chosen. 

The following technique is suggested: 

Abstracts are chosen randomly from the service for which the test is performed* 

These are photographed three times. From two batches indexing terms are cut away 
and the remaining title + citation + abstract are given to two intelligent persons, 
with a good command of English, but with no subject knowledge* They go over the 
abstracts and list all expressions found in them that are - in their opinion - of 
sufficient value to rate as indexing terms* 

Compatibility between the "indexers" 1 b developed as follows: At the first session 
the two meet with a subject specialist* After the general ideas of the proceedings 
are explained, each goes over an abstract on his own and draws up a list of his "set 
of abstract terms". The resulting two lists are compared, and where they are not 
identical the differences are discussed with a view to develop rules how to create 
"A"* Wherever possible, these rules are formalised (see Appendix A), In this way 
the indexing of about a dozen abstracts is practiced. 



the descriptors, we should design the mechanized search program to 
descriptors and only after a general term is found in "D" would we 
"A" for the more specific term* This could lead us to a two-stage 
"A" terms would make for higher precision but the "D"-search would 
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At the end of the session! each "indexer" takes home another batch of abstracts from 
which he prepares » on his own* a set of abstract terms! according to the rules. At 
the next session these two proposed sets are compared and differences are again dis- 
cussed with a view to adapt the rules. These sessions are repeated until the two 
lists - prepared at home - are reasonably close. 

At this stage another three batches are prepared containing I +AI different abstracts. 
Again two batches are given to the two participants! but this time no discussion takes 
place. Instead! the two sets of terms are compared by the subject specialist (who so 
far had no access to the set of descriptor terms "D”) • Where the two lists differ 
by more than one or two terms! the abstract is excluded from further analysis (£1 
items). Where the lists differ by only one or two terms! the subject specialist may 9 
at hla discretion! adjust the two lists to obtain "A'*! the final set of abstract 
terms (I items). 

Only now 9 after "A” has been extracted by the above procedure! does the subject 
specialist compare it to the set of descriptor terms "D". He decides to which of 
the three classes (joint 9 inclusive or unique) each term belongs! and which are the 
more specific ones. This method should provide maximum objectivity on the part of 
the subject specialist. 

In his decision-making process he has merely to compare two lists. He will look first 
for those terms that appear in both lists and will "make-up” the joint list. He will 
then have to use his discretion to decide which are synonymous expressions and which 
are inclusive ones. Finally he goes over the remainder of each list and checks 
whether these are Indeed terms for the "unique" list. 

This process was relaxed for only one special case. It happened a number of times! 
that the two "Indexers" who prepared the original lists of "A"; in their zeal to 
Include only "semantic terms” in their lists! both decided to exclude an expression 
that appeared in the abstract. But when "A" was compared to "D” it turned out that 
this very same expression (or an obvious synonym) was Indeed Included in "D". Where 
this happened 9 the subject specialist was permitted to add the term to "A". We 
decided thus because we want principally to compare mechanized retrieval efficiency! 
and this "lost term" in "A" would certainly have been found by a computer working in 
a free-text mode. 

The next step consists simply of counting! for each item! the number of joint terms 
(aj * dA * nj); of more specific and less specific inclusive terms 
(ams m o Is ; a £ s - d^g) and of unique terms & d u ) for both the set "A" and the 
set ”D"« Each of the above counts is sunned for all items! thus: 




(all joint terms) 



I 



n i " J ftms *** ^ms 
I 



(all inclusive terms) 



n, 



ua 



•[ 



a, 



u 



(all terms unique to the abstract) 



I 
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n ud * d u (all t 

I 



(all terms unique to the descriptors) 



n ua * n ud 



(all unique terns) 



N - nj + n i + n u (number of terms given to the batch of size I) 



We are now ready to try and answer the original questions: Ib there a complete over- 
lap between the two surrogates or does one give appreciably more (unique) terms than 
the other? Does one of them usually have the more specific terms? How large a re- 
call loss (fewer entry points) will be sustained if only one surrogate is used for 
retrieval purposes? 

Most decisions can be based on descriptive statistics, and no advanced statistical 
techniques are required. If, for instance, the number of more specific terms in the 
abstracts is appreciably greater than the number of more specific terms among the 



small, we know what fraction of the entry points are lost f should we decide to use 
only the descriptors. The decision whether to use one or both surrogates , will 
thus be based mainly on the expected recall loss. A test of significance will only 
be necessary to make sure that we discard the right surrogate, though in practice 
this decision will probably be influenced by factors outside this study. To present 
the data to the system designer, with whom the final decisions lie, it is thus suf- 
ficient to turn it into percentages. Confidence levels of the differences can be 
computed using a non parametric test, such as chi-square or - for very small samples 
Fisher’s exact probability test. 



The above procedure was used to test the surrogates of the U.S,G,R,D,R« (now G.R«A«) 
service. To check to what extend our findings apply to the various sections, we 
tested two widely different subjects, namely section 5 covering the behavioural 
sciences, and section 9 on electronics. That the abstracts of the two sections dif- 
fer markedly became clear already during the ’’compatibility phase". Whereas for 
the behavioural sciences quite a few sessions were required, electronics Just one 
session. As the electronics section was prepared by the same two participants only 
after they had acquired proficiency, this alone might be attributable to the 
learning process. But batches in behavioural sciences prepared later were still 
more discrepant than those prepared for electronics. In the final batches 

AI was 26% for the behavioural sciences and only 5.6% for electronics. 



descriptors ( ^ a ma > > 




Example 



AX + I 



This difference computed according to X^“ is significant with P < 0.01. 
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Table 1 : Comparison of number of terms in "A** & "D" In the 
behavioural sciences* 





No. 


% 


I 


37 




N 


624 


100 


n J 


139 


22.2 


n i 


65 


10.6 


n ua 


170 


27.2 


n ud 


250 


40 




54 


8.8 


j ^ms 


11 


1.8 



Table 2 : Comparison of number of terms in "A** & M D M in 
electronics* 





No. 


% 


I 


35 




N 


458 


100 


n J 


104 


22.7 


n i 


35 


7.7 


n ua 


194 


4 2.3 


n ud 


125 


27.4 


a ms 


25 


5.4 


^ms 


10 


2.2 
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The various counts for these final two batches are given in Table 1 (for behavioural 
sciences) and Table 2 (for electronics). Clearly , there is little overlap between 
the two surrogates, as fully 2/3 of the terms are unique. It la perhapa of Interest 
to point out that, while in one section one surrogate provides more unique terms, 
in the other the position is exactly reversed. Thus we cannot aay which surrogate 
may be dropped (should one really be prepared to forgo a quarter of the entry points). 

The more specific of the inclusive terms are always more numerous within the abstracts 
with a significance level of 0.01 < p < 0.02 for section 9 (electronics); and of 
p < 0«01 for both section 5 (behavioural sciences) and the two combined sections. It 
appears that the process dlscrlbed in the beginning, namely to use "D" aa a sieve for 
pinpointing possible hits and use the abstract for the final check, whether the item 
is relevant, may be useful (although the many unique terms in the abstract make this 
technique less attractive). 

Before taking any final decision we would crosa-valldate the results, by repeating 
the experiment - possibly on another two sections. 



Appendix A 

Examples of "Rules how to prepare set "A". 

1. List all words you deem relevant and all words you do not understand, 

2. If synonyms appear, list them. 

3. Keep concepts separate. 

4. List all countries except U.S.A. 

5. Abbreviations must be written out, 

6. Look at the bibliographic details to see if they add anything relevant. 

7. Formulas and numerical values shall not be listed. 
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EVALUATION OF RETRIEVAL EFFECTIVENESS IN RELATION 
TO USER REQUIREMENTS. 

SOME CASE STUDIES FROM THE COMPUTERIZED INFORMATION 
RETRIEVAL SYSTEM AT THE ROYAL INSTITUTE OF TECHNOLOGY, 
STOCKHOLM. 

Z. Gluchowlcz 

Head of Documentation Department 

Royal Institute of Technology Library, Stockholm 



SYNOPSIS 

The report gives a short description of the computerized 
Information retrieval system -SDI- (Selective Dissemination 
of Information) at the Institute. The user population Is 
presented and the distribution of profiles by subject 
categories Is Indicated. 

The evaluation of the retrieved Information Is based on the 
feedback from users according to the Katter scale. 

The cases studied represent real requests made to the system 
by types of users representing research at the Industrial 
community and at universities. The requests chosen represent 
science-oriented and discipline-oriented queries. The SDI 
system at the Institute has been In operation since 1967. 
About 1200 queries generating about 2000 profiles have been 
run In the system during periods of varying length from six 
months to three years. 
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Introduction . 

Evaluation of effectiveness for information retrieval systems is frequently 
expressed in terms of precision and recall ratio although there is a variety of 
components influencing the effectiveness of information search. 

The precision ratio is generally described as the proportion of relevant items 
retrieved over the total items retrieved and the recall ratio is the proportion 
of relevant items retrieved over the total relevant items in the system. 

Since in real-user environment the user of an operating retrieval information 
system is the only person to make evaluation of relevance, precision and recall 
depend on subjective estimation of relevance. 

Precision figures based on evaluation relevance of real requests made to an 
information system may be. established with a reliable degree of accuracy, while 
recall figures for a big information system can only be estimated hypothetically. 
The present paper deals with retrieval effectiveness evaluated by users in terms 
of relevance. The precision figures are based on the performance of real requests 
submitted to an operative information retrieval system. 

System description . 

Tne Royal Institute of Technology, Stockholm has developed a computerized 
information retrieval system -SDI- Selective Dissemination of Information - 
with the goal to ensure an extensive ready access to documentation in subject 
fields of Interest for the industrial community and for the educational and 
research program at the universities and research Institutes In Sweden. At 
present the SDI system processes 14 different databases, 11 which are externally 
generated and 3 which are Internally generated. (Table 1) The total amount of 
references processed exceeds one million references a year. 

Each database has its own record format designed and developed by the system 
originator. 

The problem of incompatibility of tape Information recorded in different formats 
has been solved at the Institute by the mean of a general program system for 
information retrieval - ABACUS. The system has been developed through the 
Initiative and guidance of the head librarian Bj'drn Tell. (5) 

Profile performance . 

The profile is a list of terms significant to the query which will potentially 
be found In the references. (Table 2) 

Terms or groups of terms are linked together in a logical manner as expressed 
In the search strategy. The required connections between words or groups of 
words are expressed by using Boolean logic, the logical operators being "OR", 
"AND" , "NOT". 

The search strategy may be expressed by one or several alternative logical 
connections, each connection may contain several parameters. 

The symbols for the logical operators used at the Institute are: 

"I" for "OR", for "AND", T for "NOT". 

Profile 87 B (Table 2) Is presented here as an Illustration of a search 
strategy comprising four alternative logical connections. Each connection 
contains one or several parameters which the items retrieved have to satisfy. 
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Profile 87 B 

Subject field: Array antenna design. 
Search strategy: -|E»(A&(2D |C4D)| B&(C| D) ) 

Nr. Grp Search term 

0010 A ANTENNA* 

0020 A APERTUR * 

0030 A AERIAL * 

0040 B ARRAY* 

0050 C PHASE COMPARISON* 

0060 C MONOPULS * 

0070 D EQUIVALEN* 

0080 D PATTERN * 

0090 D SYNTHES * 

0100 D EXCITAT* 

0110 D DIFFEREN* 

0120 D DESIGN* 

0130 D SIDELOBE* 

0140 D DISTRIBUT * 

0150 D PLANAR* 

0160 D GAIIN* 

0170 D TAPER* 

0180 D RAD I AT* 

0190 D COLLI NEAR* 

0200 D COHERENT * 

0210 D DIRECTIVIT * 

0220 D SIDE LOPE* 

0230 D SLOPE* 

0240 D ERROR * 

0250 D CIRCULAR * 

0260 E ENDFIRE* 

0270 E YAGI 

0280 E CYLINDRIC* 

Profile printout 



User Requirements 



* truncation flag 
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Table 2 



V Evaluation of Retrieval Effectiveness 



According to search strategy Ti&(A&(2D| C&D) |B4(C|0)) an Item will be retrieved If 

any one of four following connections Is satisfied: 

1. AS2D 

2. A&C&D 

3. B&C 

4. B&D 

and none of the terms In group E appears In the references. 

Any one of the connections 1-4 will be satisfied If the Item retrieved contains 

one of the following combinations: 

Connection 1: (word 10 OR word 20 OR word 30) AND (word 70 AND word 80 
OR word 71TAN0 word 55 OR ... OFfword 70 AND word 250 OR 
word 80 AND word 90 OR word 80TND word 1 TUTOR ...OR 
word 240~MD word 255J but NOT "(word 260 OR word 2711 OR 
word 280) 

Connection 2: (word 10 OR word 20 OR word 30) ARD_ (word 50 OR word 60) 

AND (wordTO OR wordUO OR ...OR word 250) buFNOT (word 
7E5 OR word 27? OR word 250) 

Connection 3 : word 40 AND (word 50 OR word 60) but NOT (word 260 OR 
word 270D5 word 280) 

Connection 4 : word 40 AND (word 70 OR word 80 OR ...OR word 250) but 
NOT (wor<T?60 OR wor<fT70 OR wonT"280) 

These connections represent 572 alternative possibilities for Items to be 

retrieved. 
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Ueev Requiremento 



Profiles distribution (June 1969) . 
User groups : 



Government and local administration 40 
Government research Institutes 106 
Trade associations and Institutes - 17 
Universities . 193 
Libraries 7 
Private companies 478 
Individual researches 4 

57 ? 



Subject categories : 



Mechanical engineering 179 
Iron, Steel and metal manufacturing 133 
Polymer technology 80 
Chemical technology 77 
Physics 64 
Electrotechnology 59 
Control and computer technology 57 
Reactor technology 40 
Wood, pulp and paper 38 
Telecommunl cation 34 
Transportation, packaging 27 
Business administration 27 
Inventories and storehousing 15 
Food Industry 15 

57 ? 



User-system Interaction . 

in a batch processing information system the user-system Interaction Is 
maintained by means of a dialogue between user and co-ordinator. When 
submitting a query to the SOI system the user has to provide a statement of 
his fields of Interest In natural language In a narrative way giving a detailed 
description of his subject field. He may also provide a draft proposal for his 
search profile. This he can do on the basis of the Profile Design Manual Issued 
by the system author. 

The first stage of a user-system Interaction Is the dialogue between the user 
and the documental 1st at the centre clarifying the scope and nature of the 
query. As the user Is the only person to give a correct statement of the requests 
the statement should preferably be made by himself and not by an Intermedary 
such as a librarian or an Information officer. Therefore the documentallst at 
the SDI centre Insists on maintaining direct contact with the Information 
user. (Table 3) 

The second stage of the user-system Interaction takes place when the profile 
Is being defined. This dialogue Is necessary for selecting search terms for the 
profile and for the decision on the search strategy. 



GENERALIZED FLOW CHART FOR PROFILE CONSTRUCTION 




Truncation of terms 



Construction of search logic, which dcri- 
ncs subject by combining groups of 
scarch te nm using logical operators 



logic; 



Search terms and search logic coded for 
input 



Search terms and search logic punched and 
read into computer 





Manual for profile construction 



Manual for profile input codes 



Profile stored in magnetic tape 
profile store 



I SI Source Tape 
KTH Mcch’fng 
POST 
INSPEC 

Metals Abstracts Index 

Current Index to Conference Papers in 

Engineering. Chemistry & Life Sciences 

COMPENDEX 

Nuclear Science Abstracts 

Abstract bulletin of The Institute 

of Paper Chemistry 

Food Science and Technology Abstracts 
LRIC Master Filet 

Scientific and Technical Aerospace 
Reports 

International Aerospace Abstracts 



Table 3 



User Requirements 

otifiwuwf t» totfiwiin LITTER ATUBLISTA Formulir3 



loadino techniques run multi rwo duct asswqlv line 

f6rM»o/)n»tmiHon 

•ftkprotH nr 87Q •*•»• 

pottadrtM 


konlafct»«r»OA 


ARRAY/ GAIN/ 

BORGIOTTI, g.V. 

ALTA FREQUENZA (ITALY) 


1 



..39, NO.Q 701-205 AUG. 1970 12 B7Q38246 

IMPEDANCE AND GAIN OF A DIPOLE IN AN INFINITE PERIQOIC PHASED 



ANTENNA/ PLANAR/ DIRECT IV IT/ 
FORMAN, B.3. 

RADIO SCI. (USA) 


2 


VOL. 5, NO. 7 1077-83 JULY 1970 

A NOVEL DIRECTIVITY EXPRESSION FOR PLANAR ANTENNA ARRAYS 


B7030254 


ARRAY/ RAOIAT/ 

KILLICK, E.A. 7QLEDZI0WSKI, K.A. PORTER, N.E. 

1989 EUROPEAN MICROWAVE CONFERENCE 


3 


399-402 1970 SEP 1969 

RADIATION OF DUAL POLARISATION FROM LINEAR ARRAYS 


B7O30257 


ARRAY/ DESIGN/ 

HART, G. VIOLA, R. 

1969 EUROPEAN MICROWAVE CONFERENCE 


4 


443-6 1970 SEP 1969 

DESIGN OF L, C AND S BAN) SOLID STATE ARRAY MODULES 


B7D3025Q 


ARRAY/ DESIGN/ 

PATTON, W.T STAIMAN, 0. 

1970 IEEE INTERNATIONAL CONVENTION DIGEST 


5 


390-1 1970 MAR 1970 

INTEGRATED CIRCUITS IN ARRAY DESIGN 


B7D30259 


ARRAY/ DESIGN/ 

ROSENBLATT, A. 

ELECTRONICS (USA) 

wm via Art 


6 



..hj, imu.id /G-tU 20 JULY 1970 B703I 

= ility IS DESIGNED IMT0 DUAL band mooule for mm array 

Table 5 
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Evaluation of Retrieval Effectiveness 



V 



On the basis of the user's statement the documental 1st at the centre compiles the 
list of search terms which are significant to the query and are potential terms 
In the references. As the SDI system at the Institute Is a free-language system 
the exhaustiveness of the terms In the profile Is an Important parameter In 
retrieval efficiency. 

Submitting a query to the system the subscriber may Indicate on the order card 
his requirements for precision and the search strategy Is formulated according 
to these Indications. 

The search output - list of retrieved references - (Table 5) Is delivered to the 
subscriber together with an evaluation form designed according to Katter's six 
grade scale. (Table 4) 

During the experimental period the SDI service at the Institute was free of charge 
and the user was asked to evaluate the Information received. The users' evaluation 
was used as feedback for modifying the profiles In order to obtain precision and 
recall corresponding to the users' needs and judgment. 

At the feedback stage the documental 1st at the centre amends the search strategy 
and/or search terms In order to obtain greater search efficiency. 

Profile adjustment continues the dialogue between the user and documental 1st. 

During the fiscal year 1968-69 a total of 36072 references were evaluated. These 
references have been selected by the SDI system on the basis of Individual pro- ; i 
files, which as of June 30 1969 numbered 845. 



Users' evaluation 


References Percentage 


1 . 


Of inmedlate Interest 


9080 


25.1 


2. 


Of Interest, I have already read It 


2459 


6.8 


3. 


Of Interest, but not fair Immediate use 


11009 


30.6 


4. 


Cannot determine Interest, because the 
citation does not provide enough detail 


1306 


3.6 


5. 


Of no Interest, because material does not 
correspond to what I have described to 
the system. 


11913 


33.0 


6. 


Of no Interest, because my Interests have 
changed since I described them to the system. 


305 


0.9 




3507? 


TOO 



Table 4 

The figures on Table 4 are based on relevance evaluation of real requests In 
real user environment during the free-of-charge period. 
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Veer Requirements 



Experiences . 

When the SDI service at the Institute became subject to an annuai charge ( $60: 
a year) several subscribers continued to evaluate the received information. They 
appreciated these feedback possibilities for amending profiles. 

The users were aware of two significant factors: 

A) The SDI system at the Institute is not a static system, development work is 
continously done with emphasis on programming modifications and system 
techniques. New databases are being included in the system in order to obtain 
more comprehensive coverage in the subject fields of the queries. 

B) The users' fields of Interest is not static either. The priority of certain 
aspects of the subjects of interest may change at various times and the 
profiles have to be modified corresponding to these changes. The staff of the 
system must often review the profiles and suggest appropriate changes ..-The 
documental ists' initiative and suggestions are appreciated especially by 
these users who seldom do anything to their profiles. 

The great majority of users who entered their subscription to the system after 
the free-of-charge period are evaluating the retrieved Information during the 
initial period when the profile has to be amended frequently. During this period 
the staff at the centre is analysing the failures such as wrong correlation of 
terms, overrestrictiveness or broadness of search strategy etc. When the search 
results are satisfactory to the user's need, the user often ceases to make 
evaluation assessments. 

The cases illustrated in this paper were chosen because of the longest duration 
of the evaluation period. They are cited according to the chronological sequence 
that the queries were submitted to the system. Of the 15 cases illustrated, 

7 stem from private industrial firms, marked with the letter "I", 5 stem from 
governmental research institutes, marked with "G"and '3from Universities marked 
with "U". The databases searched in each case are also listed according to the 
chronological sequence in which they have been searched. As the Source Data 
Tapes have been searched since 1968 the biggest number of references in the 
Illustrated cases come from this database. (Table 6) 
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Case 1 . 

Query: Theory of roller bearings. Theory of lubrication. (1) 
Evaluation period: 9.12.1960 - 5.2.1971 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u 
2 


a t e d 
3 


4 


5 


9.12.1968 


1S1 


271 


214 


49 


49 


115 


- 


1 


11.2.1969 


MECH-ENG 


65 


58 


10 


18 


30 


- 




30.1 .1970 


METADEX 


35 


29 


7 


3 


19 


- 




16.10.1970 


COMPENDEX 


27 


22 


3 


12 


7 


- 




total 




398 


323 


69 


82 


171 


- 


1 


percen- 

tage 








21,5 


25,5 


53,0 


- 





% 



0> 

U 

c 

07 

u 

07 



*0 

07 



> 

07 



fO 

+-» 

O 



07 

CT> 



C 

07 

O 

u 

07 

Qu 



100 

90. 

80 

70 

60. 

50. 

40 

30 

20 

10 



3 



2 



1 



relevant Irrelevant 
1+243 4+5 

(see table above) 



relevance rating 



i 

i 



AU2 



48 



Case 2 . 



Query: Nobile radio comnunicttion equipment (I) 
Evaluation period: 11 .2.1969 - 3.2.1971 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u 
2 


a t e d 

3 


4 


5 


11.2.1969 


ISI 


32 


24 


4 


4 


7 


- 


9 


29.12.1969 


INSPEC 


SO 


35 


8 


4 


8 


1 


14 


28.10.1970 


COMPENDEX 


4 


4 


- 


1 


1 


- 


2 




















total 




85 


63 


12 


9 


16 


1 


25 


percen- 

tage 








7,9,1 


14,3 


25,4 


M 


39,6 



X 



100 




1 + 2+3 4+5 

(see table above) 

relevance rating 
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Case 3 • 



Query: Noise generated by hydrodynamic devices (G) 



Evaluation period: 21.2.1969 - 3.2.1971 



database v 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u 
2 


a t e d 
3 


4 


9 


21.2.1969 


ISI 


84 


45 


1 


8 


21 


5 


10 


16.1.1970 


INSPEC 


70 


44 


4 


9 


19 


2 


10 






































total 




154 


89 


5 


17 


40 


7 


20 


percen- 

tage 








5,6 


19,1 


45,0 


7.9 


22,4 



X 

100 




(see table above) 
relevance rating 




m 
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Cast 4. 



Qu«ry: Fluidic dtvlcas (6) 
Evaluation period: lfi.4.1969 - 5.2.1971 



database 


n unbar of rafarancas 


first 

starch 


nano of 
tape 


ratrla- 

vod 


total 


a y 

1 


( 1 VI 
2 


i t a d 
3 


4 


5 


15.5. 1969 


IS! 


315 


42 


20 


11 • 


3 . 


3 


5 


16.4.1969 


MECH-EN6 


174 


44 


* 

17 


9 


3 


6 


9 


19.12.1969 


MSPEC 


279 


153 


57 


34 


3 


21 


36 




















total 




760 


239 


94 


54 


9 


30 


52 


888"- 








39.3 


22.6 


3.9 


12.6 


21.7 



f 

100 




1+2+3 4+5 : 

(saa table a boys) 

ralavanca rating .... 
405 
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Case 5. 



Query: Calculation of electrical power networks (I) 
Evaluation period: 2.5.1969 - 5.2.1970 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u « 

2 


a t e d 

3 


4 


5 


2.5.1969 


ISI 


290 


245 


52 


13 


97 


6 


77 


29.5.1969 


MECH-ENG 


29 


14 


- 


- 


7 


- 


7 


18.12.1969 


INSPEC 


645 


489 


80 


7 


201 


8 


193 


16.10.1969 


COMPENDEX 


56 


28 


3 


6 


5 


- 


14 


total 




1020 


776 


135 


26 


310 


14 


291 


percen- 

tage 








17,4 


3,3 


40,0 


1.8 


37,5 



* 

100 




relevant irrelevant 

1+2+3 4+5 

(see table above) 



relevance rating 

406 



52 



Query: Cereal -based foodstuffs (i) 
Evaluation period: 2.5.1969 - 21.12.1970 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u i 
2 


i t e d 

3 


4 


5 


2.5.1969 


ISI 


504 


491 


147 


3 


191 


18 


132 


16.10.1970 


COMPENDEX 


3 


3 


- 


- 


- 


- 


3 






































total 




507 


494 


147 


3 


191 


18 


135 


percen- 

tage 








29,( 


1 0,6 


38,6 


3.6 


27,4 



X 

100 




1 + 2+3 4+5 

(see table above) 

relevance rating 
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Case 7 • 

Query: Compressors (I) 

Evaluation period: 27.5.1969 - 5.2.1971 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u i 
2 


i t e d 

3 


4 


5 


30.5.1969 


ISI 


276 


219 


50 


• 


57 


4 


109 


27.5.1969 


MECH-ENG 


247 


232 


151 


4 


52 


2 


23 


28.2.1970 


1NSPEC 


50 


50 


6 


m 


14 


- 


30 


16.10.1970 


COHPENOEX 


50 


39 


6 


m 


17 


m 


16 


total 




623 


540 


213 


4 


140 


6 


177 


percen- 

tage 








39.5 


0,7 


25,9 


1.1 


92.1 



t 

100 




(see table above) 
relevance rating 

408 
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Case 8. 

Query: Technical forecasting on the development of electronics (G) 



Evaluation period: 13.8.1969 - 5.2.1971 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u i 
2 


a t e d 
3 


4 


5 


13.8.1969 


ISI 


206 


179 


12 


6 


14 


1 


146 


8.12.1969 


INSPEC 


427 


412 


50 


14 


21 


1 


326 


16.10.1970 


COHPENOEX 


22 


13 


1 


- 


1 


- 


11 




















total 




655 


604 


63 


20 


36 


2 


483 


percen- 

tage 








10,4 


3,3 


6,0 


0,3 


80,0 



X 

100 




1+2+3 4+5 

(see table above) 

relevance rating 
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55 



Case 9. 

Query: Pumps and pumping (I) 
Evaluation period: 19.8.1969 - 5.2.1971 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u i 
2 


a t e d 
3 


4 


5 


20.8.1969 


ISI 


435 


393 


55 


12 


66 


17 


243 


19.8.1969 


NECH-EN6 


492 


419 


63 


4 


115 


5 


232 


16.10.1970 


COHPENDEX 


33 


17 


7 


• 


5 


. 


S 




















total 




960 


829 


125 


16 


186 


22 


480 


percen- 

tage 








15,1 


1.9 


22,4 


2,6 


58,0 



i 

100 




(see table above) 
relevance rating 
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Case 10. 

Query: Chlorinated hydrocarbons In different organisms (U) 
Evaluation period: 25.8.1969 - 23.1.1971 



database / 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u i 
2 


a t e d 
3 


4 


5 


25.8.1969 


1SI 


739 


685 


192 


26 


460 


5 


2 


8.9.1969 


POST-P 


56 


43 


4 


- 


13 


20 


6 


26.9.1969 


POST-J 


24 


17 


5 


- 


7 


5 


- 




















total 




819 


745 


201 


26 


480 


30 


8 


percen- 

tage 








27,0 


3,5 


64,4 


4.0 


i.i 



X 

100 




1+2+3 4+5 

(sea table above) 



relevance rating 



Case 11. 

Query: Nickel-Iron batteries, iron electrodes (u) 
Evaluation period: 3.9.1969 - 5.2.1971 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u 
2 


i ted 

3 


4 


5 


3.9.1969 


ISI 


50 


22 


17 


2 


2 


m 


1 


6.9.1969 


MECH-ENG 


8 


8 


2 


- 


5 


m . 


1 


30.1.1970 


METADEX 


30 


26 


5 


2 


18 


• 


1 


9.3.1970 


1NSPEC 


6 


5 


1 


• 


4 


• 


m 


total 




94 


61 


25 


4 


29 


- 


3 


percen- 

tage 








41,0 


6.6 


47,5 


m 


4,9 



t 

100 




(see table above) 
relevance rating 
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Query: Aerials (g) 

Evaluation period: 6.9.1969 - 5.2.1971 



database 


number of references 


first 


name of 


retrie- 




e v 


a 1 u i 


a t e d 






search 


tape 


ved 


total 


1 


2 


3 


4 


5 


6.9.1969 


IS! 


356 


332 


12 


52 


112 


- 


156 


29.12.1969 


INSPEC 


53 


40 


9 


12 


18 




1 


16.10.1970 


COMPENDEX 


6 


4 


- 


1 


3 


- 


- 




















total 




415 


376 


21 


65 


133 


- 


157 


percen- 

tage 








5,6 


17.3 


35.3 


- 


41.8 



% 

100 




1+2+3 4+5 

(see table above) 

relevance rating 



\ 



Case 13. 

Query: Optimization of preventive maintenance (I) 



Evaluation period: 7.11.1969 - 5.2.1971 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u 

2 


9 ted 
3 


4 


5 


12.11.1969 


1SI 


107 


99 


73 


3 


17 


3 


3 


7.11.1969 


MECH-ENG 


79 


71 


43 


6 


20 


2 


• 


28.2.1970 


INSPEC 


88 


87 


65 


3 


15 


i 


3 


16.10.1970 


COMPENDEX 


20 


11 


10 


- 


1 


- 


- 


total 




294 


268 


191 


12 


53 


6 


6 


percen- 

tage 








71 ,3 


4.5 


19,8 


2.2 


2.2 



% 

100 




(see table above) 
relevance rating 
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Case 14. 

Query: Information on radiation protection (6) 



Evaluation period: 30.3.1970 - 23.1.1971 



database 


number of references 


first 

search 


name of 
tape 


o 

retrie- 

ved 


total 


e v 

1 


a 1 u i 

2 


a t e d 
3 


4 


5 


30.3.1970 


ISI 


695 


642 


139 


47 


74 


10 


372 


1.4.1970 


INSPEC 


201 


247 


70 


23 


17 


4 


133 


26.5.1970 


NSA 


575 


436 


115 


30 


83 


- 


208 




















total 




1551 


1325 


324 


100 


174 


14 


713 


percen- 

tage 








24,4 


7,5 


13,1 


1,1 


53,9 



i 

100 




1+2+3 4+5 

(see table above) 

relevance rating 



415 



61 



Case 15. 

Query: Influence of mineral nutrients on morphogenesis (U) 

Evaluation period: 2.6.1970 - 23.1.1971 



database 


number of references 


first 

search 


name of 
tape 


retrie- 

ved 


total 


e v 

1 


a 1 u i 
2 


a t e d 

3 . 


4 


5 


2.6.1970 


ISI 


1032 


1032 


44 


1 


245 


29 


713 
























































total 




1032 


1032 


44 


1 


245 


29 


713 


percen- 

tage 








4,3 


0,1 


23,8 


2.8 


69,0 



% 

100 




(see table above) 
relevance rating 
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1 . 10 " 



Number of references processed per year as fun 



:tion of time 



9.10 s 



8.10 



7.10 5 - 



6.10 s 



5.10 ■ 



s 



r 



data- 

base 



CICP 

ERIC 

MOOD 



CAC 



IPCAB . 



COMPEN 
DEX 



NSA 



NYFLI . 
MA 



INSPEC 



POST 



4.10 s 



I960 



1969 



1970 



1971 



MECH- 

ENG 



year ISI 



Table 6 
417 

/A I 



1 » 1 r- 



6 3 



o 



V Evaluation of Retrieval Effectiveness 



Conclusions . 

According to Heaps (8) the effectiveness of mechanised information retrieval system 
expressed in terms of relevance may be represented as a function of question, 
estimate of relevance and database. 

Effectiveness = F (question, estimate of relevance, database) 

This functional definition of effectiveness is consistent with our three years of 
experience with the SDI system at the Institute. 

The user's information need, his statement of request, ,his relevance requirements 
and relevance judgments, his tolerance of irrelevance makes him a significant 
parameter of this functional definition of effectiveness. The system exists for 
the user and in order to satisfy the user's need. The databases have to be chosen 
with the goal of covering the fields of interest of the queries submitted to the 
system. 

In the SDI system at the Institute most of the profiles are searched on several 
databases, because the queries are more problem-oriented than discipline-oriented. 
The distribution of profiles among the databases as of December 1970 is shown 
in the table below. 



Number of profiles 



Number of databases searched 



62 1 

58 2 

63 3 

201 4 

187 5 

32 6 

4 7 



Although the system is based on a multiple data file, there was no significant 
overlap among the data bases. Infact they tend to be complementary by handling 
different aspects of subjects. 

The effectiveness of the search results in suitable databases has depended in 
a very high degree on the active interest of the subscriber. The user with 
experience and training in information work or with some knowledge of the basic 
principles of a mechanised information retrieval system will have a more 
meaningful dialogue with the system. He takes a more active part in the develop- 
ment of his profile and his requirements of precision and recall are easier to 
satisfy because they are more realistic. According to our experience scientists 
working with basic research are more inclined to use a broader search strategy, 
they are Interested in browsing through greater amount of references and there- 
fore are more tolerant of a higher percentage of Irrelevance, (case 8) 

The engineer from the industrial community prefers a more restrictive search 
strategy. He requires higher precision even when he knows that he may miss an 
amount of relevant Information not retrieved. He tolerates this omission 
because his reading time is limited to a few references and he does not have 
the possibility to retrieve these items by a manual search in a large database. 
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User Requirements 

Profiles giving search results with 90% or 100% relevance (case 1) very rarely 
occur. The overage precision in the system as evaluated by the users is ranging 
from 40% - 65%. 

The increased attention to the user as significant part of an information retrieval 
system has stimulated various kinds of user studies. 

This paper does not give an evaluation study or a user study as such. Such 
statistical study demands a great amount of evaluation assessments from the users 
and a continuous feedback analysis. 

A computerized system designed to measure retrieval effectiveness in terms of a 
real user behaviour, gathering and maintaining data for different decision 
functions would be a desirable tool for a reliable study. With the increasing 
implementation of on-line techniques transactions files for such studies will be 
easier to create and maintain. 
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CYBERNETIC ANALYSIS OF COMMUNICATION SYSTEMS 



Patrick R. Penland and James G. Vi 1 Hams 
Graduate School of Library and Information Sciences 
University of Pittsburgh 



SYNOPSIS 

There is a pressing need for research to test the cybernetic model 
of communication. This need is met in part by an experimental 
conmunity encounter system, where the entropy reducing components 
of the cybernetic model operate in a simulated social system. This 
system hrs been developed at the Graduate School of Library and In- 
formation Sciences, University of Pittsburgh for research and in- 
structional purposes. Demographic, land use and legal data of 
Allegheny County (metropolitan Pittsburgh) serve as a cold-start 
constraints before fifteen weekly cycles carry the encounter sys- 
tem fifteen years into the "future.” The on-line/off-line model 
is cybernetic in terms of its response to in-put (man or machine) 
and can cause new interpretations of the response by those receiv- 
ing the input for the next cycle (man or machine). The encounter 
model of decision making involves information handling specialists 
in the organizational and interpersonal effects of information 
transfer in governmental, management and conmunity decisions, and 
provides for the collection and analysis of data on a number of 
conmunications and sociological variables for research purposes. 

Introduction: 

Since communication is the process of creating situations (dyad, group, cotnnu- 
nity) in which meanings can be engendered in people (15), then considerations of 
community context cannot be ignored by those in the conmunications profession. In 
general, it is the function of the disciplines to create new knowledge (largely for 
its own sake). On the other hand, the professions endeavor to make knowledge kinet- 
ic (information surprise) in the lives of people and call attention to lacunae in 
knowledge which if researched and created by the disciplines, would be socially 
useful (18). Without this symbiotic relationship, the disciplines become overly 
academic and the professions waive any leadership role for civic betterment and 
social accountability (19). 

Until recently, no underlying theory of the societal enterprise existed which 
was not a static model. Even the model proposed by Hall (8), whole concept of 
"awareness” is particularly insightful, cannot easily be related to cybernetic 
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theory (17). There exists however the socio-drama model of Burke-Duncan (10) which 
can account for the cybernetic interaction of community endeavor. The rules of the 
social enterprise constitute the behavioral outcomes of the roles played by adapt- 
ive control organisms as they react cybe metical ly to the perceptions of latent 
and even explicit social entropy. 

The adaptive control organism has a built-in and dynamically creative antip- 
athy to entropy. Necessity may be the mother of invention, but the imperative to 
escape entropy is the critical social matrix of significant symbol. Men demand 
structure almost more than bread as the reasoned and concluding statement of 
Brinton in his Ideas and Men (4) would indicate or as the pessimistic pronounce- 
ments of Toffler (20) would in a negative way try to panic the social enterprise. 

Granting that men demand structure, the tragedy has been that most social the- 
oreticians have proceeded in their analyses upon the basis of mechanistic princi- 
ples (2). However an alternative position has been developed which cybernetically 
shapes the referents of the symbolic enterprise (9). The cybernetic structuring 
processing of the symbol releases participants from both the mechanisms of the 
mores as well as the Berkeleian solipsism. This model has been embedded In the 
community encounter system developed at the University of Pittsburgh with both on- 
line and off-line role components. 

In the coraminity encounter system, constraints are abstracted from societal 
patterns and operate within the democratic rules of gaming procedure. Participants 
deploy themselves according to models of conmunicat ion, employing sociodrama for 
its function of guiding social criticism and information surprise as the motive for 
change. While the inclination of powerful elements in the social order is towards 
criticism only, information supply is positive and developmental, encouraging other 
participants towards a "profiles of courage" approach. 

In order to realize its cybernetic potential, comunication science must in 
Duncan's words "take into account what was communicated, the situation in which 
the conmunicat ion occured, the kind of person who communicated (i.e. his role), the 
means he used, and the social purpose of his communication" (10:17). Duncan goes 
on to develop his thesis in the explication of several axiomatic, theoretical and 
methodological propositions. These are the propositions of sociodrama theory which 
may be refined as they are verified through research design. It is one thing to 
talk about research design in the sense of traditional social science (13). It is 
quite qnother to determine and assemble the research contexts wherein theory can 
be verified, let alone establish experimental vehicles for the testing of hypoth- 
esis (9). 

Community Encounter System: 

There is a need to know more about the communicative abilities of humans as 
well as the means for predicting such ability based upon experimental techniques 
which minimize error and maximise yield of information. The convnunity encounter 
system has been developed to study an on-going simulated environment which faci- 
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lit a tea research on the problems and component relationships of conmunication. The 
encounter system provides for a gestalt of nodal activity wherein the entire ga- 
mut of human conmunication esn be studied. The encounter simulation provides a 
central focus in the development and evaluation of new programs and methods as well 
ss the social justification of existing ones. It also focuses attention on the 
need for long range systematic research as a basis for planning situations wherein 
effective conmunication will be engendered. 

For example, on an elementary level, videotapes and the computer have already 
proved much more effective in recording complex behavior for measurement purposes 
than the traditional pencil and paper methods. Of course the "cameramen” are under 
control of at least one researcher to ensure that its "selective eye” follows the 
"sense” of the dynamic encounter situation being recorded. The computer monitors 
the input of each role participant, provides print-outs of individual and group 
performance as well as statistical analyses and updates the constraints for each 
succeeding weekly (annual) cycle. 

The encounter simulation is a unified model of communication based on a very 
few principles which are available in cybernetics. Cybernetics in its assumption 
and principles is sufficiently general to consider it a unified approach to commu- 
nication, It can be a powerful tool for conmunication analysis. Homeostasis is 
not simply a response to a stimulus, but a process occuring in all parts of the 
encounter system and results from a transactional process among ever extending 
sub-systems. Cybernetic analysis can be used to diagnose interpersonal relations 
and categorize such transactional behavior as the communication components which 
accompany the self-stabilizing interactions of the entire encounter system. The 
tra^actional process requires conmunication of information and motives at many 
levels. Information processing can be related to the activities of learning, 
thinking and understanding. 

Based upon previous knowledge as codified in the social sciences ( 11 ) and in 
particular information science ( 14 ) a number of assumptions underlie the research 
vehicle. Principally among these assumptions are the following: 

People are willing to pay for information as a social resource to 
the degree that it is perceived as an essential ingredient in 
decision making. 

The degree of consensus underlying the decision making process as 
represented on the print-out varies directly as the encounter simu- 
lation progresses. 

The number of people involved in decision making varies directly 
with the range of contacts in the conmunication system. * 

People in encounter role playing situations react in a manner 
isomorphic with real life. 
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The function of criticism keeps the social system under control 
in terms of coranunity objectives. 

Power structure response to continuous citizen criticism follows 
an ordinal distribution: (a) agreement and well-wishing; (b) 

coirmlttees and/or commissions are appointed; (c) economic and/or 
legal action occurs as a result of committee or cocnnlsslon reports* 

The advantage of cybernetic analysis is that the model transcends subject 
boundaries, and views people and institutions as adaptive control systems of re- 
ception, transmission, evaluation, and storage* Input la perception; judgement 1 b 
decision-making; and output is behavioral action* The findings of the newer "inter- 
dl8Clpllnary M social sciences have been synthesized in order to explicate the three 
components of the cybernetic model. This encounter model has been developed to re- 
place the verbal speculation of the older social sciences with the on-line and off- 
line cybernetic components of reception, decision-making and behavioral patterns. 

The teleological thrust of the of f-lin j roles is toward cooperative or compet- 
itive behavior for valued ends by employing the entropy reducing functions of cy- 
bernetics for research purposes. The findings of each of the newer social sciences 
shed light upon, and delineate research matrices for the functions of the three 
components of an adaptive control organism. The following table which 1 b somewhat 
more abstract than Figure 1, indicates the cybernetic components of the community 
encounter system which has been developed for research and instructional purposes: 



Cybernetic Model Components 



Perceptor 



Interpretator 

Hole-structure (off- 
and on-line) 



Effector 



Organized and pro- 
jected information 



On-line Data Matrix 



Mass Convnunlcation 



Cognitive structure: 



Land use 



Organizations 



organized information 
space 



Financial 

constraints 



Talking chains 



Affective structure: 



Legal constraints 



power 



Demographic 

constraints 



government 
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The encounter vehicle has been designed to explore a number of theoretical 
considerations. Among the propositions for which measurement devices have been 
initially developed are the following: 

The understanding of participants towards their roles varies directly 
with increased information and convnunicative contacts. 

Social power varies directly with the ability to supply information 
for decision making and with the number of convnunicative contacts. 

The complexity of decicion making processes varies directly with 
the amount of information available and the number of convnunicative 
contacts. 

The explication of social issues and problems is a function of 
information. 

Issue awareness and problem identification varies directly with 
citizen involvement and monitoring of governmental decision making. 

The first component of the cybernetic model, stimulus and reception, has been 
explored in the disciplines of information theory, linguistics, sign and symbol 
behavior which together lay a foundation for the analysis of information surprise. 
From a cybernetic system-model point of view there ore two basic purposes for 
convnunication : to inform and to motivate. Convnunication functions to alter a 

receiver's concepts and to change his preference and feelings so that a wider range 
of alternatives exist for decision-making and behavioral response. This has the 
effect of increasing the complexity of decision making as well as extending the 
time Interval between problem identification avad behavioral action. In the con- 
struction of a message, It is necessary to build In informational and especially 
motivational referrents (12). However, according to the sociodrama model of convnu- 
nicstlon (10) and the experience of the mass media in persuasive convnui.ication, 
motives are changed first and then concepts are filled in. 

Human behavior is the output of a controlled, not an uncontrolled system. 

This Viewpoint is consistent with the analytical concepts used in convnunication, 
decision-making snd systems analysis whi?h are considerations in the selector and 
effector stages of the cybernetic model. In addition, they are similar to the in- 
formational and motivational aspects of convnunication, to the opportunity and pre- 
ference functions of decision-making, and to the broader cultural notions of scien- 
tific and value judgements in society. Interpersonal relationships and transac- 
tional analysis in group, organisation and coitvnunity endeavor are means of enlarg- 
ing opportunities directly both in concept effectiveness and preference strength. 

Human behavior is based on information about things, not necessarily on the 
nature of things. The human is able to accumulate reservoirs of information over 
s period of time through s process of symbollsation or conceptualisation. Concepts 
can be of things as well ss of events. Once information is stored in the form of 
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concepts, it can be tapped, or released, by cues. No action or cue can occur at a 
distance without some connection either of a cause-effect or a feedback relation- 
ship. The cause-effect action, no matter how complex, proceeds in one direction 
and is corrmon in the natural environment. A feedback system, however, involves re- 
ciprocal relations which can be studied by the multi-correlational analysis of many 
variables* Correlation can not establish cause effect relations, but only show 
that variables co-vary. Correlation is very useful in exploring an environment to 
determine what variables should be brought under controlled study using other clas- 
sical methods to show cause-effect. 

Freedom of opportunity varies directly with an increase in number of informa- 
tion sources and depends on access to ccmvnunication media. Mass media constitute 
a major stimulus source for social endeavor as well as the conrmnity encounter sim- 
ulation. In order to maintain a bargaining advantage, the flow of information can 
be curtailed about the matters under bargaining* Where information cannot be con- 
trolled directly, perception may be modified by criticizing co*Tmunication content 
or by raising compelling points of view against it. In any event, organized in- 
formation space (library as prototype) is considered to be of surprise value as 
well as serving as an entropy -reducing function, whether the "beings of reason" in- 
cluded were developed in the past or are emerging at the present moment. 

The second component, the preferential phenomena of interpretation, is explor- 
ed in the disciplines of game theory, decision theory, value philosophy and psy- 
chology which taken together help to explicate the preferential patterns of the 
adaptive control organism. Descriptive and normative studies of decision-making 
have led to the analysis of aspects of human behavior in which choices are made 
among alternatives. For example, Simon has pursued the ramifications of decision- 
making and value theory in his Models of Man (6). 

By way of illustrating the availability of choice among alternatives, the 
following changes in inputs which vary directly with outputs have been adapted from 
Cadwallader (9): (a) the innovation rate is a function of the legal and economic 

rules which organize the problem solving matrix of the social system; (b) the range 
of communicative contexts varies directly with the available variety of information; 
(c) comnunication capacity cannot exceed the quantity and variety of information 
supply; (d) the sociodrama model provides the rules (criticism) for forgetting or 
disrupting the highly probable organizational patterns of the power structure. 

Any cybernetic system has both input and output. The effect of the environ- 
ment on the system is input, while output is the effect of the system on the en- 
vironment. Within the system, the effect of one component upon another is a func- 
tional relation or interaction* In a cybernetic system, the governor or selector 
is set by the larger or encompassing system. For example, the value system of a 
human individual is not calibrated by the individual alone, but mainly by the cul- 
tural system. The setting of the governor or the establishing of values for con- 
trolling behavior is a parameter of the cybernetic system determined by the larger 
environment and during the traditional experimental study of a system is supposed 
to remain unchanged. However, in the correlation analysis of nonparametfic dis- 
tributions, it is not ossential to hold independent variables under rigorous 
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There is one area of cormunications research to which a program of investi- 
gation in the encounter simulation leads directly. Professional education Is based 
upon an academic grading system which, while it may be pertinent to a degree pro- 
gram, does not necessarily ensure role definition and development. The encounter 
simulation creates situations in "real-life" standards where effective comninica- 
tive behavior can be developed in context. In addition to evaluation which re- 
flects content gain or progress, role definition and development can be evaluated 
in relation to defined group norms. 

Actually from the studies already undertaken there is a reason to believe that 
research employing canonical correlations in the encounter simulation can be as 
productive as investigations carried out in highly organised experimental frame- 
works. "Value free" distributions may be logically desirable but in consnunl cat ions 
research are difficult to achieve. The advantages of research in the encounter 
simulation lie not so much in the degree to which the findings are rigorously veri- 
fied, but rather in the large repertoire of types of studies that become possible 
and in the built-in stimuli to inquiry. Because of the nature of the encounter 
simulation, the faculty and participants have the combination of energy, Insight 
and integrative effort that is necessary for constructive inquiry about fundamen- 
tal problems (see Figure 4). 

Before experimental research can be undertaken in social comnunications, many 
studies need to be conducted in contexts employing statistical controls rather 
than experimental controls. The distribution of variables must be identified and 
compared not only with one another but also with the characteristics of the com- 
ponents and subcomponents of the encounter system. Instead of specially selected 
subjects who might fit more readily into traditional experimental design, subjects 
are the students who register in the communications sequence. Considerable pro- 
gress has been made in the development of evaluative instruments, including de- 
vices which as final instruments in one phase of evaluation can be used as Initial 
measures at the beginning of another. 

After all, there are really few indispensable elements in research design that 
cannot be conducted in the encounter system. The instructional personnel respon- 
sible for the library -community encounter simulation have a research point of view 
and possess knowledge of appropriate experimental and statistical techniques. An 
office of tests and measurements has readily available not only instrument to 
measure achievement as well as participant characteristics, but also has the re- 
sources and the means for test development, administration, scoring and statistical 
analyses. 

Except for the randomization of treatment groups, many experimental controls 
can be applied after the observations are collected. Groups can be stratified or 
matched after the experiment is completed just as easily as before. Whenever the 
original groups are large enough to permit fractionalisation, specifications can be 
imposed on the groups during the analyses. Computers esn be programmed to accom- 
plish this readily. In the matrix of intercorrelations and weightings, the unique 
contributions of any of the independent variables can be determined by factor an- 
alysis. Any variable can be interchanged as to, whether it is to be used as a con- 



428 



Comun icat ion Systems 



trol or as a predictor* A system of experimental inquiries will generate valuable 
results without designing a new experiment for each new set of conditions. 

Older social science methods of research have been evaluated in relation to 
components of the encounter system and to characteristics of senders and receivers* 
In addition, new methods and devices not only are being invented but they also are 
being evaluated in relation to "established 11 procedures. New methods can be eval- 
uated relatively easily without radical disruption of the encounter simulation. 

For example, subject groups need not necessarily be matched. Nor do they have to 
be of the same size "or even be concurrent. It is however necessary that the "con- 
trol" groups be measured for initial level of skill or knowledge to be communicated 
as well as for all the characteristics under consideration as referents. 



Experimental control as employed in the social sciences with human subjects 
uses a limited range of one or two variables. The subjects may be selected to 
yield predetermined distributions, or members of like paira, or triads, or quartets 
may be assigned to treatment groups, usually at random. Treatment is the indepen- 
dent variable, residual gain the dependent. However, other variables may be used 
for control. Correlational analysis or canonical correlation permits the use of 
conventional significance tests as to whether changes in the skill or variable un- 
der study are related to method. Also through analysis of variance the practical 
importance of changes may be assessed to better advantage than in the usual ex- 
perimental designs. 

Summary : 

Organization adds to the transactional model of conmunication but one element 
of structured cooperation which Includes the development of goals and the shared 
experience of values or motives. With organization comes the complications of in- 
creased roles but also of relationships among the sponsors or owners (if a business 
or industry), the staff and the recipients. The whole society is an organization. 
Government is the formal organization of power. Property and legal rights consti- 
tute the set of rules about transactions. Whether the awareness of them is in- 
formal, formal or technical (4), transactions among citizens are constantly going 
on and continuously being formalized into "permanent" coalations. For government 
not to intervene in the transactions of power groups is to sanction those trans- 
actions. The scope of government is total for its society since its actions and 
inactions cover all possible situations. 

The adaptive control organism is most productive when the cognitive and affec- 
tive domain work together in a complementary manner. However such a harmonous re* 
lationship is neither always the case nor possible. Be this as it may, both realms 
are essential components of the conmunity encounter system. Figures 2 and 3 de- 
lineate this cooperative vs. competitive interaction which is the matrix of the 
social enterprise. The encounter system is composed of participant roles and the 
first line of research has focused on the* entropy-reducing functions of cocmunica- 
tion (information) agents* This depends not only on the manner in which informat- 
ion space is organized but also and especially on the way in which information la 
employed to cause surprise in receivers. 
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Many specialized information processing canters are available in metropolis, 
but there is a serious lack of coordination. There la no one clearinghouse to 
which to turn. The discrepancy which exists between need and the mustering of in- 
formation and of making it kinetic in the affairs of citizens la approaching a cri- 
sis. When left unattended, it is interesting that participants begin to speculate 
on the contributory affect of a lack of available information upon the urban dis- 
order in the simulated encounter environment. 

Information networking and library planning is an extension of the surprise 
value of information retrieval. Networks and inter library cooperation require the 
combination of materials, services and professional expertise which is impossible 
to achieve by one library alone. Information networking adds the dimension of an 
effective switching center at the regional, state and national levels. 

Libraries in the simulated metropolis work together through regional library 
centers in order to avoid unnecessary duplication of resource collections and to 
facilitate access both to their own resources and to other libraries in the nation- 
al network. Numerous other information centers exist which function to provide 
significant services to specialized interests. All of these benefit in varying de- 
grees from the information transfer (bibliographic) network. Each of these may 
have other needs for Information which primarily cannot be met from documents that 
stetu from the usual publishing and other distribution channels. 

A good amount of knowledge, particularly that listed to the work-a-day life 
of the people, exists in records and sources that are ephemeral and not highly or- 
ganized, or perhaps remain unorganized. This type of knowledge is often mission- 
oriented. Information la retrieved from such sources only ao long as there is an 
obvious and continuing need, usually for a limited clientele such aa a housing pro- 
ject or a neighborhood activist cell. There la another extensive category of know- 
ledge which does not exist in records at all, but la available only in the minds of 
specialists and professional experts. This kind of knowledge becomes available on- 
ly upon consultation. The information la rendered kinetic when the individual in- 
terfaces with a consulting expert over an inmedlate problem or specific interests 
such aa Information Volunteers Services of the Health and Welfare Association of 
Allegheny County. 

Finally there is the knowledge of the present moment, the near past and the im- 
pending future where the emerging needs of the people help to shape the information 
sources of the present moment. There are several media of conmunlcetlon in metro- 
polis whose knowledge of the present, the near past and the inmedlate future la a 
constant and continuous source of information which few citizens in the area could 
avoid even if they wanted to do so. Most people's requirements are conditioned 
partially by the media and partly by their daily occupations. Consequently the in- 
formation spaces sought, range widely over ephemeral and mission-oriented sources 
of knowledge. Only a small percentage of the information required every day is 
sought in the depth for which the bibliographic record la designed to serve. Of 
course without such Inquiry from the record, however small in quantity, civilisation 
would flounder for lack of perspective. In the long run, libraries of the record 
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ere an integral element of modem vigorous civilizations, but in the day-to-day 
preoccupations of people they often seem Irrelevant. 

Consequently a conmunlty-encounter simulated social system has been developed 
by the Graduate School of Library and Information Science, of the University of 
Pittsburgh. The encounter simulation has been built upon n cybernetic model of 
contnunlcatlon for research and instructional purposes. The power -deployment of 
government and the surprise value of organized information space are the central 
entropy-reducing components of the model being tested. Social science research 
methods and distribution-free statistical techniques have been used to study re- 
lations of variables in the in-put and out-put phases within and a:.iong each other 
as well as with characteristics of the entropy-reducing components. 
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A GENERALIZED THEORY FOR THE EFFECTIVENESS 
AND UTILIZATION OF INFORMATION* 



M. C. YOVITS 

Department of Computer and Information Science 
The Ohio State University 
Columbus, Ohio 43210 
U.S.A. 



SYNOPSIS 

A general theory Is presented which defines the term ’Information" 
as "data of value In decision making”. Decisions In turn result 
In courses of action which In turn result In actions that can be 
observed or measured In the physical world. A generalized theory 
Is suggested which establishes relationships and procedures for 
the flow of Information. Analytical relationships are suggested 
for describing the effectiveness of Information as a function of 
resulting observable actions. 

Introduction 

One of the major problems that exists In the development of Information storage and 
retrieval systems Is that of developing methods of evaluation for these systems. 
Only when such methods have been established will It be possible to compare 
various systems as to their effectiveness and then ultimately to attempt to decide 
on the value to bo placed on such a system. Only when effective criteria for 
evaluation of Information systems are available will It be possible quantitatively 
to determine whether It Is worthwhile to develop one system rather than another 
or even whether It Is desirable to develop any system at all . 



*Resoarch sponsored In part by the National Sclonce Foundation through Grant No, 
GN-534 from the Office of Science Information Service to the Computer and 
Information Science Research Center, Tho Ohio State University. 



V Evaluation of RotvUval Effeativenooe 

Naturally In order to establish specific measures of effectiveness for Information 
systems It Is necessary to develop quantitative measures of Information and to 
relate the Information quantitatively to some measuro for the utilization of Informa-* 
tlon. At tho presont time any measure for the effectiveness of Information Is 
generally limited to the rather general, qualitative question of relovance to a user 
or group of users. Tho question of what rolovance may mean or of how relevant 
information may be Is normally not considered. The question of what value can be 
placed on a particular measure of relevance is not generally considered. 

In the present paper a theoretical treatment which enables the establishment of 
quantitative measures for the effectiveness and utilization of information Is 
presented. Some of this work has been presented In earlier papers and Is accord- 
ingly summarized here. Suggestions for extending the previous work aro given In 
order to establish relationships which enable the evaluation and comparison of 
information systems. 

Background 

The term information has many different connotations, uses, and meanings. It Is 
sometimes used In tho rather specific sense that Shannon and Weaver (1) have 
established. In this sense, although Information and Its properties and appropriate 
analytical relationships are well defined and quantitatively measured, the class of 
situations which is covered Is a small part of those of Interest. In tho situation 
considered by Shannon and Weaver the context of the message Is of no significance 
and the theory Is concomed with the probability of the receipt of any particular 
message for various conditions of the transmission system. 

On the other hand, Information Is frequently, perhaps almost universally, considered 
to be the equivalent of "knowledge" . It Is in this sense of the term Informatio n 
that Information scientists and Information systems designers and users are 
concerned, and It is this senso of the term which Is the popularly understood 
moaning as well. 

In the Shannon-Weaver sense, the term Information Is Indeed quantifiable, 
measurable, and rigorously defined, but the theory Is of such limited applicability 
as to have little utility with regard to the design of information storage and 
retrieval systems. In the more general sense of Information being equivalent to 
knowledge, the term Is neither rigorously defined nor measurable nor quantifiable. 
Furthermore, it seems unlikely that such broad and vague conceptions can yield 
relationships that are meaningful and useful in any real sense. 

Thus, we have been concerned with the formulation and development of a theory of 
generalised Information flow which has wide and perhaps universal applicability 
(2)-(S). This theory loads to a model which allows rigorous mathematical treatment. 
One of the motivations for the development of this model Is the desire to establish 
analytical relationships which have as wide applicability as possible. These 
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analytical relationships can then be used to derive principles and Identify 
limitations on the flow and transfer of Information. The model may also be used to 
define fundamental quantities Involved In Information flow, Including Information 
Itself, In a quantitative, rigorous, measurable and repeatable way. In particular, 
the model can be used to generate specific measures to be used in evaluating the 
effectiveness and the utilization of information and for the comparison of 
information systems. 

Properties and Definition of Information 

To be of value it is clear that knowledge must at some time be put to use. That use 
Is to assist In making decisions; that Is, information used In order to make 
decisions . Indeed, the only resource that is available to a decision maker is 
Information, nothing more. Thus a definition of information has been proposed (2) 
which forms the basis for the model of a generalized information system which was 
mentioned above. This definition is: information ig. data of value in decision 
making . While this definition may delimit the total range of Interest in information 
in an intellectual sense, it does have virtually universal applicability with regard 
to any potential applications • It is further claimed that any more general definition 
of the term is not amenable to the quantification and analytic conceptualization 
necessary to the treatment of the study of information as a science. 

In the physical world, the only way in which a real parameter, say a mass or a 
velocity, can be measured is through an interaction of a system with its environ- 
ment. If this interaction does not exist then it is impossible to measure a 
parameter or even to establish its existence. The same statement can be made of 
information • Without some type of interaction with the environment it is not 
possible to measure information or to know of its existence. Such an interaction 
will result in some kind of physical change which can be measured or observed. 

A physical change or observable action will take place as a consequence of the 
operation of the decision-making apparatus. The decision maker will recommend 
courses of action which are in turn executed, yielding a set of observable actions. 
Since, as indicated above, the decision maker makes decisions solely on the basis 
of the information he receives, it is clear that this is a fundamental mechanism 
by means of which information is transformed into physical, observable actions. 

These remarks concerning the definition, measurement, and significance of informa- 
tion lead to the development of a model of information flow. This model can then 
be used to obtain fundamental analytical relationships which can be used for 
measuring the effectiveness of information. It is the purpose of this paper to 
propose such a theoretical approach. 
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V Evaluation of Retrieval Effeotiveneee 
The Generalized Information System 

Since the model of a generalized information system forms the basis for our approach^ 
it will be useful to describe this system in some detail in order to provide the 
necessary background to the present research. The generalized model is depicted 
schematically in Figure 1 • 



External Environment External Environment 




Figure 1. Generalized Information System 

The system is comprised of four essential functions. These include the 
Information Acquisition and Dissemination function (IAD) , the Decision Making 
function (DM), the Execution function (E), and the Transformation function (T). 

As has been pointed out, virtually all situations involving the flow of information 
can be described by this model. These situations would include the use of 
information by the research scientist or the development engineer, management of 
a large corporation, command and control of a military engagement, or such 
relatively straightforward and simple activities as the switching on or off of a 
j, thermostat-furnace system. 

I Personal decision-making problems are described by Figure 1. It is not even 

\ necessary or required that the decision-making process be a logical one. This 

| model is applicable when decisions are completely irrational, as may frequently 

be the case. Each function is seen to collect, store, operate, and disseminate. 

I In any realizable and operational system all the indicated functions must be 

| present, and they must be considered together In order to understand information 

flow or in order to establish principles, relationships, or guidelines for Information 
flow. Just as in the analysis of any system, suboptimization or consideration of 
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the functions independently may yield misleading or Incorrect results. 

In particular, the Decision Making (DM) function is a most important one and is 
established as the key consideration in the entire information flow process. The 
DM function represents any system component accepting an input from the 1AD, 
and providing an output to E. The DM may be an individual person, an organization, 
a man-machine system or simply a machine system. In all of these cases, the DM 
transforms information into courses of action which are in turn transformed into 
observable actions. The input to the DM is information, some of which may be 
stored or held in memory. The DM makes decisions on the basis of the information 
available at some particular time. The decisions are made individually, serially, 
sequentially, or in parallel and, of course, the decision-making process may be 
delayed. This may be the case when more information is necessary. But the 
decision maker is responsible for the generation of observable actions and will 
eventually make decisions that will lead to these actions. 

A very significant point concerns the closing of the feedback loop to the DM. In 
any system this loop must be closed to provide a basis for retaining or altering 
the courses of action disseminated and it is only on this basis of closure that the 
DM is able to refine or alter decisions intelligently. Such feedback is always 
present whether or not it is explicitly considered. Most of the presently designed 
so-called "information systems", however, do not consider feedback or closure 
to the decision maker in meaningful ways. The loop in the system is closed by 
the transformation of the observable actions generated by means of appropriate 
measuring devices, into data transmitted to the IAD which are in turn disseminated 
to the DM. This feedback provides the major interaction which the DM has with 
the total system and is the only way for the DM to probe and understand the system. 

Since information is supplied only so that a decision maker can make decisions of 
some kind it is most appropriate to analyze our model by starting with that particular 
function • 

Note that the decision maker collects three basic kinds o i information from the IAD. 
There is information on the particular activity under consideration (that is, data 
which have been obtained by transforming the observable actions resulting from 
operations of the Execution function). There is information on the external 
environment over which the decision maker has no control but may have knowledge. 
There is also other fundamental information which the decision maker may utilize. 
This latter category Includes reports, tables, mathematical and physical constants 
and appropriate relationships. The DM stores a data base in its memory. 

The DM operates as follows: it develops a predictive model which It believes will 
transform the information received into the appropriate observable actions. This 
model developed by the DM may be an accurate one or it may be incorrect. It may 
even be irrational or illogical. Nevertheless, it is the way in which the DM 
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believes that the only resource available It to, namely Information, will be 
transformed to observable actions. It is only after further Information Is available 
from transformations of these observable actions Into data that the DM "knows" 
whether Its model Is an accurate one or not. Secondly, the DM may alter 
recommended courses of action using the same predictive models, as further 
Information regarding the particular activities Involved or the resulting observable 
actions become available. Thirdly, the DM may develop new models as further 
Information becomes available either concerning the observable actions or the 
external environment. In this situation the original model was Incorrect or 
Inadequate regardless of the courses of action suggested. The DM disseminates 
courses of action (results of decisions) which are communicated to the E function. 

The E function Is responsible for transforming decisions Into observable actions. 

It collects courses of action from the DM and perturbations from the external 
environment. If It were not for these perturbations from the external environment, 

E would be essentially deterministic and would transform decisions In a predictable 
way Into observable actions*. It Is largely the action of the external environment 
which provides much Inherent uncertainty In the process of transforming Informa- 
tion Into observable actions. There Is also "Internal noise" which Introduces 
fluctuations Into the observables . The E function 9tores nothing other than Its 
structure or design and has no memory per se. It Is strictly a transforming process. 
It operates by transforming decisions Into observable actions. 

Observable actions or observables are quantities which are physical In nature. As 
the term Implies they are capable of being observed or measured * They are neither 
date nor Information, but are capable of being transformed Into data. This 
distinction Is an important one conceptually. Some examples of observable actions 
are the heat generated by a furnace, Vhe position of an aircraft, results of a 
scientific experiment, a new product developed by a manufacturing firm, or the 
movement of men and equipment In a military engagement. These are physical 
quantities, In themselves neither Information nor data. 

In order to become data or Information, observable actions must be transformed. 
This Is accomplished by the function which, for want of a better name, Is called 
the Transformation function. This function Is fundamentally a measuring device 
which transforms the physical observable actions to data. This distinction 
between data and observable actions, although perhaps obvious, Is not a trivial 
one. It Is an Important point to keep In mind. Data are transformations of the 
observable actions. 



*Thls assumes that the E function Is known by the DM. This knowledge Is 
generally obtained only by successive Iterations of decisions courses of 
action, execution,' generation of observable action, transformation to data, 
and then transmission to the DM, further decisions, etc. 
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The last function remaining to be discussed in this model is the IAD function or 
perhaps, more accurately, the Information Acquisition, Storage , and Dissemination 
function. The IAD is frequently referred to as an "information system" . Although 
it is of course a system it is in fact an open-loop system and, as we have shown, 
only a part of a much more extensive closed-loop system. As has been indicated 
it is important to consider the total closed system as shown In Figure 1 . The IAD 
is only a part of this system and must be treated accordingly. 

In order to measure or even discuss the effectiveness and utilization of information 
and of information systems it is necessary to consider the total system as 
Indicated in Figure 1 • In other words the total process encompassing the decision 
making and execution into observables must be of concern In the determination of 
effectiveness and value. It is clearly impossible to determine any measure of 
utility without considering the total process. To consider the IAD by itself 
independently of the decision making, as is generally done, can not give meaning- 
ful results. Some understanding then of the class of users (decision makers) and 
the resulting observables must exist to develop significant measures. 

The IAD function collects data from thrcu different fundamental sources. It 
collects data on the particular activity under consideration (that is, data which 
have been obtained by transforming the observable actions produced). Data from 
the external environment are also collected. The external environment includes 
all the factors over which the decision maker has no direct control. Finally, the 
IAD collects basic data such as references, tables, reports, textbooks, relation- 
ships, etc. The IAD disseminates data for the use of the decision maker. These 
data, if used, are information within the context of our definition of information. 
This distinction between data and information is of fundamental importance here. 

The model of a generalized information system has now been described in summary 
detail. All of the functions indicated must be present for any meaningful analysis 
of information flow. Information now can be defined and analyzed in generality 
by the use of this model. It is further contended that only in such a model does 
information really have any significance. General discussion of the storage 
of knowledge or data without reference to the potential uses leads nowhere, or 
provides minimal fruitfulness. 

General Relationships Involving Information 

The general definition of information which has been proposed Is "Information is 
data of value in decision making" . Data on the other hand are transformations of 
observable actions. They result from the measurements which have been made 
onphysical quantities. Information Is thus inherently involved in the decision 
making process and inherently involves transformation into observable actions of 
some kind. 
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Note that information must be a relative quantity depending upon the decision 
maker, the total history of the decision maker, and the particular environment 
involved. In this sense it is different from the physical universe where a 
physical quantity is absolute and independent of the particular situation. That 
is, a mass of 20 grams generally remains a mass of 20 grams independent of the 
position or momentum of the mass. 

It should not be surprising that information is to be considered relative. For 
example, a sophisticated research paper in some technical field may have a great 
amount of information for a scientist who is well versed in that field but no 
information to, say, a housewife who would be completely unable to make use of 
that paper. The background of the decision maker must necessarily be important 
inasmuch as two decision makers given the same document may act differently, 
thus generating different observables. 

We may represent the relationship between the information and the observables as 

X x T ■ 0 

where X is the information, transduced (T) by the DM and the E leading to the 
observable actions, 0 .. More precisely, one might more properly discuss the 
changes in the observables 

AT x T -A 0 



Generally, X and 0 each would consist of a number of components and might be 
thought of as being represented by vectors. 

Note that there will be a smallest amount of information that will cause the DM to 
change a decision which in turn produces a change in the observables. That is, 
it is the smallest amount of information which will cause the decision maker to 
change a course of action. As we have noted, this smallest amount of information 
is relative to any particular situation. This smallest amount of information is 
fundamental in the relationships developed. This fundamental unit we call an 
informon . It is convenient to measure information content in units of informons. 



Discussion 

It is just this value of the information in informons which is directly related to the 
significance of the information received. As we have shown, the amount of 
information In a message is a direct function of the change in observables caused 
by that information. However, the change in the observables may not by itself 
be of direct significance • It is the change in the observables as a function of the 
minimum change that would be obtained from a change in a decision that is a 
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quantity of considerable significance. This is closely related then to the amount 
of information measured in units of informs. 

Note that we do not specify what the various functional relationships actually may 
be. These must, as we have shown, be dependent on the decision making process 
which is involved. Thus, the relationships will differ from situation to situation 
and even from time to time. Or to put It in another way, Information and Its value, 
its effectiveness, its measurement, Its utility, and other quantitative measures 
must all be relative and dependent on the particular situation which Is being 
considered. As we have indicated previously, a particular scientific paper would 
contain different amounts of Information for scientists with different backgrounds 
and it would clearly contain a different amount of information for a military 
commander or a statesman. Each of these individuals would make different 
decisions generating different observables from the same document. Of course, 
an individual might make no decision at all if the document is of no Interest to 
him or if he is unable to understand it. 

Once it becomes clear that Information must be treated In a relative way, dependent 
on the particular situation which Is considered, then many conceptual and specific 
relationships also develop. Naturally it is difficult for us to accept this type of a 
situation Inasmuch as we are part of a physical universe where the measurement 
of a physical quantity is generally absolute and independent of the situation 
considered. Thus specific measure's for the value and effectiveness of information 
and information systems must consider in detail the ultimate objectives to which 
the information is to be put and the types of decision makers who will use the 
Information. Currently, at The Ohio State University we are pursuing research 
to establish more definitively some of the relationships that must exist among 
these various activities and parameters for various classes of users or decision 
makers. While it is necessary to know the decision making rules In detail In 
order to develop specific relationships, it Is possible to develop general, useful 
relationships for different classes or types of decision makers. General relation- 
ships of this type are currently being sought In our research. 

Conclusion 

In an attempt to establish some fundamental relationships that describe the 
principles of Information flow and transfer and which define its limitations some 
earlier papers have developed a generalized theory of Information flow. It Is 
believed that this theory has wide and perhaps universal applicability. This 
theory which has been previously described leads to a model which allows 
rigorous mathematical treatment. In particular, It defines Information as data of 
value in decision making and develops mathematical techniques for measuring 
and quantifying rigorously information In a general sense. 
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This paper uses that theory to suggest a procedure for measuring the effectiveness 
and utilization of information and information systems by relating the information to 
the change in observables which is attributed to the information in the document 
or message. It is indicated that a fundamental unit called an informon, which is 
the minimum amount of information required for a decision maker to change a 
decision, plays an important role in these determinations. If the Information Is 
measured in terms of informons, then It is closely related to the effectiveness or 
value of the information . 

In this paper it Is pointed out that further research is taking place to establish 
general relationships which may be used to develop criteria and quantitative 
measures of effectiveness for information and Information systems. While detailed 
measures will depend specifically on the user or decision maker and the types of 
decisions made, It Is possible to develop general relationships which are 
applicable to classes of decision makers. 
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ON THE NOTION OF RELEVANCE IN IN FORMAT ION RETRIEVAL* 

C.V* Negol$g 

National Council for Scltnca and Technology, Bucureatl, Romania 

SYNOPSIS 

The paper preaenta a new point of view about the "relevance 19 
concept* The logic of information retrieval la analyied (two- 
valued, many -valued and infinite-valued logic) and the truth- 
value la Interpreted aa the relevance value* The logical 
ayeteai which aeeaa to be moat appropriate to deacrlbe the 
procear of Information retrieval la the contlnuoue logic 
auggeated by McNoughton* In thla caee the answer of the In- 
formation retrieval ayatem la Interpreted se a fusty eet. 

INTRODUCTION 

Over the laat twenty years, a number of attempts to furnish a theoretical founda- 
tion for the retrieval proceaa atand out, but little accepted theory exlsta aa yet. 

One of the most eerloua difficulties In the theory of Information retrieval le the 
lack of adequate explications for the notion "relevance 99 * Thla term la used so widely 
and ao broadly today that It tenda to connote fusty thinking* 

Some experts pointed out that the relevance was not the subject of detailed theo- 
retical investigation. Donald Hillman 9 a opinion la only one of many: 

"In recent theoretical discussion of the structure of Information storage 
and retrieval systems, one major problem has received little attention* 

This Is the problem of elucidating the concept of relevance, or connecti- 
vity. The importance of having a clear notion of thla concept Is hardly 
a matter of doubt, for retrieval outputs consisting of citations to docu- 
ments which are either unrelated to each other or Irrelevant to the 
prescription are totally unserviceable for the user. It Is therefore 
imperative that the concept of relevance be adequately defined and a 
suitable place found for It In the theory of Information storage and 
retrieval systems ." 1 

This is not to say that scientists were not very much aware of the importance of 
relevance concept and that a fair amount of discussion vaa not devoted to the subject. 
Rather, It suggests that, historically, considerations were not directed toward and 



'Editor's note: This paper was retyped from a somewhat unclear originals and due to 
delays in the mails we were unable to get the author's corrections to the manuscript 
before going to press. We apologise to Mr. Negoita and our readers for mistakes or 
omissions in the paper * 
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did not result in a theoretical, or even less, a mathematical framework for the in- 
vestigation of the relevance concept*). 

There is no universal agreement at present on what the primary definition of rele- 
vance should be. Some specialists go so far as to express doubt that the problem admits 
a definitive solution at all. Obviously, this is a topic in which no one can yet claim 
the final word. 

With the opinion that we can get really significant insight into the structure of 
information retrieval systems only by applying powerful algebraic methods, this paper 
is Intended as a contribution toward what might be a general theory of information 
retrieval, touching upon some features that seem to have had little attention. The 
research worker will find here some fresh points of departure for future research. 

INDEXING 

Before proceeding to a technical definition of indexing lat ua review briefly the 
intuitive ideas involved. 

The retrieval process can In general be considered as a matching process between 
a search request and a set of documents* In so-called descriptor systems, the re- 
trieval of items of information in response to a request involves comparing an item 
specification derived from the request, with the descriptions of the actual items of 
information. The preparation of these descriptions la normally raferred to as index- 
ing: the assignment of a set of tags or descriptors to provide an indication of the 
content and a means of retrieval. 

An information retrieval system is then one which compares the specification of 
required items with the descriptions of stored items and retrieves, or Hats, all the 
items which correspond in some defined way to that specification. The content identi- 
fiers are used during the file search as principal criteria for discriminating between 
retrieved and non-retrleved items. Finally, those items ara identified aa relevant to 
the request whose key information is considered to be sufficiently similar to the 
request keys. 

Thus, a relevance sssesaent becomes a value judgement on a retrieved item. 

We are now ready to formalise the indexing process aa follows: 

If d is a descriptor and x an information item, consider the proposition "the 
item x has the descriptor d". It is symbolised in the calculus of propositions by the 
single letter p. Rut in the calculus of functions we refer to the fact that the pre- 
position has an inner structure and we separate the subject "item" about whom something 
is said, from what la said about him, namely, the property of having the descriptor d. 



*Serloua attempts at preliminary explications of ths notion in question have been how- 
ever made. In the work of Goffman* - substantial contribution, though rathar raatrlc- 
ted in axtent, to the achievement of an adequate explication - relevance la a relation- 
ship between the search requaat and the set of documents, l.a. a measure of information 
conveyed by a document relative to a request. 
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As the symbol of this relation, we employ that of the mathematical function and sym- 
bolize the propoaltlon In the form d(x).^ 

The argument sign x corresponds to the subject about which we speak; the function 
sign d corresponds to the property holding for It. The symbol d(x) indicates the In- 
ner structure of the propoaltlon. 

Since d and x are varlablea 9 the expression d(x) can be given various meanings by 
suitable specialization of these variables. If only d is specialized, the expression 
d(x) la not yet a proposition. The sign d Is called a propositional function (predi- 
cate); the combination d(x) Is called a functional. If we wish to Indicate the 
variable In a propositional function, we write d(ft); the circumflex distinguishes 
this expression from the functional d(x). The expression d($) therefore, means the 
same as d. 

The object correlate of the propoaitional function, the property denoted by It, 
la called a situational function. For instance, the property, or situational function, 
of redness la denoted by the predicate, or propositional function "red". 

There are two ways of constructing a proposition from a propositional function, or 
from a functional. The first has been mentioned. We substitute a constant for the 
variable x, that Is, we go from d(x) to d(x]). This la the method of specialization. 
The second procedure for constructing a proposition from a propositional function la 
the method of quantification, which employs the all quantifier ^ x and the existential 
quantlflerS x. 

The aim of an Indexing system la to make propositions that Inform us about the 
content of the documents. The proof of the truth Is baaed on experience and observa- 
tion, sources of knowledge that play no part In logic. 

THE LOGIC OF INFORMATION RETRIEVAL 
Two-valued loalc 

In a two-valued propositional calculus we write 'WV for propositional varia- 
bles, for negation, "A" for conjunction, " v M for alternation, M => " for 

Implication, M s " for equivalence. 

Well-formed expressions built up from the preceding symbols will be called 
formulas. 

Formulas must just have two "truth-values"} "true" and "false". They ars true or 
false; they may not be true and false together. Hence It Is Intuitively clsar that v 
there are 2 n combinations of values for a formula mentioning n different propositional 
variables. 

The given character la tlzat ions of the propositional operations ars axprsassd In 
following "truth tables", In which the truth-value v(p) of a proposition p, which Is 
either truth or falsehood, la denoted, respectively, by "1" and "0". 



p 


— P A 


10 V 


10 D 


10 a 


1 0 


1 

0 


0 1 
1 0 


10 1 
0 0 0 


i i i 

10 0 


10 1 
11 0 


1 0 
0 1 



Using these tables we may dstsrmlne the value of any formula for all possible values 
of the variables. 
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Let us consider Che formula pAq. Clearly we have 4 combinations of values for p and 

qs 

for p 110 0 

for q #•••••••••••••!••»»• 1010 

for pAq 1 0 0 0 

If Che two-valued propositional calculus is accepted as the underlying logic of an 
information retrieval process, the algebra ( Ip) » v« Ai/v » u, t ) of propositions, 
where t is the tautology and u ia the contradiction, may model the storage and re- 
trieval of information. This algebra is a Boolean algebra defined as a system 
(B, *, + , — ,0,1) for which the following axioms holds 

1. - The binary operations . and + are commutative 

If a,b £ B, then a.b*b.& and « + b» b + a 

2. - Each binary operation is distributive over the other 

If a,b ( B, then a,(b + c) ■ a.b + a.c 

a + (b.e) • (i + b),(a+c) 

3. - There exist in the set B two distinct identity elements 

0 and 1 relative to the binary operations • and + 
respectively. 

A.- The unary operation - is such that for every element 
• a in set B there exists an element S in B such that 

a.I ■ 1 
a + i ■ 0 

Thus, in this model*) (queries being represented by fcvAulas and the file of 
u^cuments to be interrogated by a finite set) there is a unique subset of the file 
wh’ch corresponds to each formula. For each member in the file the formula takes 
on *. truth-value of "1" or "0", depending upon whether the document is or is not rele- 
vant to the query. Hence the answer consists of all members in the file which take 
on i vnlue of one, i.e., all members for which the formula is aseertable. 

If we now consider the relationship between propositions and truth-values, in the 
light of the above explanation, then the truth-value may be identified with the rele- 
vance value , The existence of the truth-value establishes the precise nature of the 
connection, *'liminatlng the confusion. 



* An equally woli -known interpretation of the same formal system is supplied by the 
so-called calculus of classes** If a propositional function d(2) is given, all 
arguments x that satisfy d(2) can be incorporated in one class, the class D. Every 
propositional function thus defines a class. Vice versa, every class can be re- 
garded as defined by a propositional function. The arguments x for which d(x) is 
true are called elements of the class D. For the expression "x is a member of D" 
we write xCD. 

Let us consider the algebra fP(X), f|,U» • ,0,X) of subsets D € P(X) of a 
set X of documents, which is als'> a Boolean algebra, if D is interpreted as the set 
of all documents indexed under term b th*ns 
D + E 
D . E 
0 
1 



D n E the set of all documents indexed under terms D and E 

DUE the set of all documents Indexed under terms D or E 

0 the sat of no documents 

X the sat of all documents 



A50 




N 



f 



i 

• Relevance in Information Retrieval 

\ If the two-valued logic has been accepted as the underlying logic of an Information 

\ retrieval process , the retrieval system may be defined formally as a relation on abs- 

tract sets Q and A representing, respectively, the set of inputs (search requests) 

! and the set of outputs (answers). 

| Scq x A 

\ y e regard a system as a structure which receives Inputs, performs some function, and 

I produces an output* Since the Inputs can be considered as causes and the outputs as 

\ effects S Is a .functional system 

j S : Q-*t 

Defining the search function f of the system by the mapping 
! f : {q) x X 

the answer A of the system is the graph of this function?) 

' A -{(x,f q (x)) J xCX } 

Many- valued logic - > 

I In the two-valued logic, let jp] be the set of propositions, L2 the set |0,1J 

! and V2 a function defined by the mapping 

| v 2 ! (p) L 2 

The value V2(p) is called the two-valued truth-value of the proposition p 

(1 If the proposition p Is true 
V 2^P) " [0 if the proposition p is false 

Such a function is a homomorphism of the algebra of propositions onto the two-element 
i Boolean algebra of truth-values. 

In a similar way, in the three-valued logic (of Lukasiewicz) 

I V3: Jp) -* L3 

^ J |*1 If proposition p Is true 

v 3(p) ■ 1 1/2 If proposition p is indeterminate 

10 If proposition p is false 



I * Sincc f q (x) - 1 if x ( A 

| f q (x) - 0 if x 4 A 

i the retrieval system can be defined as 

I S £ Q x P(X) 

f. S 1 Q -» P(X) 

■ In other word., the retrieval syatra is the eyatea (X,Q,S) where S is the retrieval 

| function. The set of docu.enta S(q) is said to be retrieved by the request qcQ. 

j That is, the mapping S maps each request qC Q into some subset of the document 

set X. 

i 
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In the four-valued logic (of Lukasiewicz) 
v«: (p| 



v« (p) 



[1 If proposition p Is true 

2/3 If proposition p Is possibly true 

^ 1/3 If proposition p is possibly false 

iO If proposition p Is fals 



In the many-valued logic (of Lukasiewicz) 

v n* {Pj ** 

* 



where 



i 0 ' fci* bv 



n=2. i] 

n-1 ’ 1 J 



Some values from Lq correspond to r levels of falsehood » other values correspond to 
s levels of Indetermlnatlon and other values correapond to t*n-r-s levels of truth • 

There Is an algebra of many-valued logic that could play the role of Boolean 
algebra In the case of classical two-valued logic* The many-valued Luka a 1 evict algc 
bras were Introduced by Nolsll as models for Lukasiewicz 1 many-valued propositional 

calculus* 5 

The Mol8ll algebra Is a lattice N which satisfies the following axioms: 

1*- M is a distributive lattice with flrat element 0 and last element 1 
2*- M has an Involutlve duality named negation 
N(aAb) - Ha V Nb 
N(aVb) - Ha A Mb 

NNa * a 

3.- M has n-1 chryslpplen endomorphlsma a-^C^a 



MH 



a 


0 


1_ 

n-1 


2 

n-1 * * * 


n-3 

n-1 


n-2 

n-1 


1 


Cl* 


0 


0 


0 • • • 


0 


0 


1 


0 2 a 


0 


0 


0 • • • 


0 


1 


1 


C n-1» 


0 


i 


1 . . . 


1 


1 


I* 


^0 ■ 


0 and 


i 


- 1 









Gia a H C ja ■ 0 

V N Gi* - 1 

4.- The elements O'ja form a linear ordered lattice 
c • • e^n-l* 
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5. - There is the relation 

G* Cj« * Gj« f°r any i,j 

andCjNa ■ N Gja where j *V*(i) 

6. - If G^a - G^b (i«l,.,.,n-l) then a-b 

Sicoe^ proved that a Holsll algebra la a system (M, 1, N, u *n-l’^^ 

where 1 la not postulated as last element, verifying the following axioms: 

1. - aAN(NaANb) - a 

2. - aAN(NbANc) - N(N(c A a )A N(b/\a)) 

3. - N( G ^Na A Cj a) - 1 , j (1) 

4. - (Tx (Tj(aAb) - ffj® > for any i,j 

5. - CT^a AG R Ha - NGja , 1 *k , J-tf(l) 

6. - If G^a ■ C*b (UUn-1), thenipa*b 

and that a many-valued Lukasiewicz algebra with if (1) ■ n-1 (mod n) Is a Kleene 
Algebra. 

If the many-valued logic has been accepted as the logic of an Information retrieval 
process the search function g of the retrieval aystem ScQ x A can be defined by the 
mapping 



i.e., the proposition (or the formula) should be permitted to take on any value on 
|o,l] • All real numbers t such that Oftfl shall constitute the aet of truth-valuea 



operatora (thoae of the two-valued logic). Formulae In L c are then defined by the 
following aet of rules: 

1. - every propositional variable la a formula. 

2. - If F la a formula then*vF la a formula, 

3. - If F and F* are formulae, then FA F* ,FVF* ,F »F* , Fi F’ are formulae. 




and the answer, by the aet 




The continuous loalc 

In a continuous (inf in it e-v slued) logic 




which the formulas are permitted to take. This logical system, which we shall call 
L c , may be constructed as follows: ve shall admit a denumerable number of propoal- 
tlonal variables denoted M Pi M » M P2 M » ••• t ,, p n M together with a number of propoaltlonal 



Pi* 1 » ,I P2 II » ••• t M p n M together with a number of propoaltlonal 



■ t 



- t 
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We ahall denote 



v(p) 

v(F) 



■ V Evaluation of Retrieval Effeativenese 

With every formula F there Is associated a truth function {(t^ t n ) vhere t- 

are the truth-values of the propositional variables p* The value of the truth* 

j function ^ Is said to be the truth-value of the formula F. Thun, the truth-value of a 

formula Is determined solely on the basis of the truth-values of Its propositional 
j variables . 

We shall now define* the truth function of the elementary formulas by generalising 
the two-valued case. Thus: 

v(~p) - l- t 

v(poq) ■ Min (1 - t + u , 1) 
v(pvq) “ Max (t,u) 

; v(pAq) ■ Min (t,u) 

The propositional operators M v M end "a" «*y he defined! 

i p v q •• (pDq)»q 

; paq •• ^(-^pv-q) 

If formula F contains the propositional variables 

then we shall write F(pi Pn> and 

. v(F(p 1# ...,p n )) ■ H* (tj t n ) 

| ft 

Suppose given a function '{(tit. . . # t n ) defined by 0(ti^l» l-l,...,n. McNoughton 

has shorn that for a function $ (t 1# . .. ,t Q ) to correspond to a formula F(p A Pn>t 

such thatf • V 9 It la necessary and sufficient that y satisfies the following 
conditions: 

1, - {is uni-valued, continuous and 

0 ^ { (ti, • • . 9 t n ) ^ 1 If O^tj ^ 1 9 1*1, ••• 9 n 

2. - there exist a finite set of polynomials tV* 

^j-bj+mijti*f...4m n jt n where b end a are Integers, 

such that for any (t^...,^), 0 £ t^ < 1,1*1 n 

j exists a J, 1 4 J * A such that { -1 j(t A t n ) 

i 

If the continuous logic has been accepted as the underlying logic of an Information 
retrieval process, In the retrieval system SCQ x A the search function h la derived 
1 by the mapping 

h l ^qj x X-a 10 9 1] 

This function maps the set X in the unit interval. Then the answer A la a eet of 
ordered pairs. 

A - ^(x,h^(*))\* x xj 

where h q (x) le termed the grade of membership of x in A. Thus, the answer of a re- 
trieval system la a fussy set, i.e., a class of objects In which there Is no sharp 

* Another continuous logical system (the probability logic L p of Relchanbach) was 
sugg sated by Coffman?. In lp* 

v (~p) - l-t 

v (pmq) ■ 1-t+tu 

v (pvq) ■ t-Hi-tu 

v (pAq) ■ tu 
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boundary between thoae objects that belong to 



Relevanoe 

the class and 



in Information Retrieval 

those that do not.3»^ 



Let A and B be anavera in a apace X t with the grades of membership of x in A and 
B denoted by h A (x) and hg(x), reapectively. Then for all x in X: 



C 

D 

E 



A ■ B *-• h A (x) 
AcBh h A (x) 
A(JBm hc(x) 
A OB «-#h D (x) 
X h^x) 



- h B (x) 

* ^(x) 

- Max (h A (x) p h B (x)] 

- Min (h A (x),h B (x)] 

• 1 - h A (x) 



If A P B and D ■ A OB are bounded convex answers then l-Suphp(x) is the highest degree 
of separation of A and B that can be achieved with a hyperplane. 



The family T of answers in X la a fussy topologylO, which satisfies the following 
conditions! 

McT 

if A P B€ T then AnBCT 
if A t CT then U A* C T 



The pair (X P T) la a fussy topological space. 
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Session Five - Discussions 

EVALUATION OF RETRIEVAL EFFECTIVENESS 

Chairman: Dr. A . Coakx (Belgim) 



NR . T.M. AITCHISON (VKj: Tbs necessity for our investigation arose when ve were de- 

veloping the Science Abstracts system into the integrated information service that 
became INSPEC. There was a need to find e subject indexing system which would be 
suitable for retrospective searching of the date base and SDI t as well as for the 
printed indexes to the abstracts journals. 

We decided that there were essentially five possible index languages we night use: 
Title; title plus the abstract; subject heading in modified line that we use in the 
printed indexes in our abstract journals; uncontrolled or free indexing; thesaurus- 
based controlled indexing. 



Obviously , it would have been ideal to be able to use the title, or the title plus 
abstract t since both of these were bound to be in the data base when we were producing 
Journals. Next best would be the subject heading, as long as we were using that 
system. 

The other languages, free-indexing and controlled languege 9 were less atrectlve since 
these would have to be added to the data base. We had experience with both of them 
since we had been using a thesaurus-baaed controlled language in our experimental SDI 
service and in the SDI service in electronics before we integrated it with our main system. 

On that basis 9 ve sat out on the lengthy process of carrying out an evaluation of 
the ratrlsvsl performance. The beat one was controlled language. The next bast vaa 
free indexing. Then tha printed index system. Then the title and title plus abstract. 

In other words, the ranking was exactly the inverse of what ve would have liked. 

We then considered the question of the coat effectiveness and practicality. We 
ruled out the last three 9 the cheapest ones * the printed index system and the title 
end title plus abstract. That left us with the free indexing and the controlled lan- 
guage. At this stage ve carried out failure analysis in recall with these two lan- 
guages. The free-indexing language had a lower performance 9 probably dua to our not 
having included in the question formulations all the possibilities for stating tha 
different concepts that appeared in the questions. 

The answer of course was that the thesauri we used were not based on the indexing 
of our documents and therefore could not be expected to provide what ve required, 
whereas the controlled language thesaurus which had been developed was baaed entirely 
on the document file we were using. So ve came to the conclusion that if, for a 
variety of reasons, a controlled language would be impractical, ve could hope to ob- 
tain the benefits of a controlled language by using a natural-language or free-index- 
ing thesaurus. Thus ve would consider each new concept as it arose for adding to the 
thesaurus. But the important thing was that the free indexing would remain unchanged 
on our data base and this would be available for searching in natural language. This 
would avoid having us make any decisions in the thesaurus that would affact the data 
base, and would be there forever. 
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Our evaluation was done specifically for INSPEC, in our own particular environment, 
and has no pretensions to general applicability. However, I wonder whether the conclu- 
sions, such as they are, may not be worth considering in relation to other subject 
fields. 

MR* U* BLOCH (Israel): I didn’t know about Mr. Aitchison's work at all until last 

night. In a peculiar way we have used complementary techniques in order to find out 
the same thing. We found in an information center which wants to use ready-made bases, 
usually you have bases that are either based on abstracts, or indexes, or index terms 
and abstracts. In this last case, it is fairly pertinent to find out whether one of 
the two, the indexing terms or the abstract alone, will give sufficient retrieval, or 
whether you need both. 

Mr. Aitchison's work is based on relevance assessments for retrieved items. Now, 
we all know that this expression of relevance is not a clear-cut affair of either yes or 
no. We all know that something can be relevant if you haven 9 t seen another paper on 
it, but by the time you have seen three papers, the first paper would not be relevant 
any more because you already know about it. For that reason we felt that we would 
like to develop a technique that is not dependent on assessments, but to do something 
that is much more objective. 

We tried to minimize subjective evaluation by a two-step process. As far as the 
indexing terms are concerned there is really no difficulty - you have the terms. But 
when you want to find out what is in an abstract, you cannot simply go and say: let’s 
compare the abstracting to the indexing terms, because then vour own subjectivity will 
automatically say: well you can say this term is equivalent to the other. And we 
found very quickly that this is something highly non-objective. We decided that the 
extraction of terms from the abstracts hss to be done In a way that will ensure that 
it will be consistent. Once we have developed something consistent, we no longer say 
it is subjective, because we find the sa®e abstracts repeatedly give the same kinds 
of indexing terms. 

After the first step of finding a list of possible terms, we invite a subject spe- 
cialist to compare the list of terms that we have culled from the abstracts with the 
terms in the index. We permit him to make only three decisions: Do you find synonymous 
terms in the index and abstract? Do you find that one of the two has a more specific 
term than the other? Do you find that the term appears only in one surrogate and net 
in the other? This is about as far as we can go to ensure that it will be as objective 
as possible. 

From there on, it iB really a matter of counting. We asRumc that the greater the 
number of entry points the greater the effectiveness. In principle I think this right. 
If you go on you can of course make as many points as you want; it becomes a very noisy 
affair, and your precision will suffer badly. No doubt in theory this is true, but as 
long as you take only indexing terms from abstracts or indexing terms that have been 
prepared, you may assume that the terms you find in these are relevant to the subject 
of the item and you will not create too much noise. 

Another remark I have heard ip that all we have actually done by culling index 
terms from the abstract which we afterwards compare with the index terms prepared 
originally,, is that we have prepared a second list of indexing terms; and as we all 
know, index terms are highly inconsistent. We feel that since we stressed greatly 
that the list culled from the abstract must be consistent between two people, this 
randomness has been eliminated, and we do not think that this is just another 
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list but something more reliable. We have in fact found that after some time there is 
fairly good consistency between two or three indexers from the point of view of culling 
terms from an abstract. When we did statistical tests to see whether there is proba- 
bility of chance in this, the probability was fairly low; about 1 in a hundred, and 
less, 

Mff, /?, NJERPPE (Sweden): (Presentation of Gludhowiaz paper). The goal of the SDI system 
of the Royal Institute of Technology, Stockholm, is, and I quote: "to ensure an exten- 
sive ready access to documentation in subject fields of interest for the industrial 
community and for the educational and research program at the universities and research 
institutes in Sweden," This is quite an ambitious goal, and we have tried to fulfill it. 

In 1968, our system developed very slowly until we prepared our first information 
brochure and held our first information seminars. We have since had a steady rise in 
the number of inputs, during the whole period that the service. was free. When we 
started asking for money, there were a large number of cancellations* Today we ar* 
going up again, and the number of cancelled profiles is decreasing. 

In profiling, we use the usual Boolean logic. We use AND, OR and NOT operators 
and an operator which we might call SELECT, The SELECT operator uses a number and a 
letter. For example, 2B means that any 2 from B will do. Our Boolean operators 
operate on groups of terns instead of single terms and we have both left and right 
frontations. Our search terms can be of any sire. In fact, in our profile we have 
12 different types of terms. You can ask for all authors, or for a publisher, or for 
affiliation, or for the name of a Journal, or for key words or classification and/or 
chapter heading* All in combination as you like. One of the big problems in maintain- 
ing these profiles is the interaction between the user and the documentalist, I want 
to stress that this interaction is something dynamic; a profile is not made up once 
and for all, but must be revised regularly* 

Relevance judgement or precision judgement is not something static which can be 
given in percentages. We process totally 14 different data bases with over 1 million 
references per year. For those data bases which have more structure, you have to have 
different versions of the same profile, and ideally you should have one query profile 
for each base because each base is unique in content and structure. This is not 
practical, so we take the largest structured data bases and divide them. For example, 
we have a separate profile running on INSPEC, This profile first of all sub-divides 
INSPEC tapes into three parts: physics, electrotechnology and computer control. We 
then search that part which is pertinent to the question, using a normal profile and 
also using the classification or chapter headings which are there, in connection with 
natural text. This means that having first used all the natural test possibilities 
we broaden the scope of the query by combining the categories and the natural text 
terms. In this way we get more precision without losing recall, within the context 
of the data base. 

Regarding profile construction and updating, we introduced statistical tools; Cor 
example, statistics on the number of hits for each search profile per data base run. 

If someone continually receives 1,000 references per run, something is wrong, because 
this is mostly not what the customer wants* We have some customers who are quite 
satisfied to be getting nothing, because this means that nobody else is engaged in 
same work* On the other hand, we have customers wtio are very eager to receive even 
200, 300 references every week, although the precision might be as low as 10%; they 
8 till want it exactly this way. We have tried to explain to them that there are ways of 
doing it better, but they want it as it is* And we don't interfere with their wishes. 
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V Evaluation of Retrieval Effectiveness 

We also produce a diagnosis for each output. For all profiles vhich give an out- 
put of more than 20 references per run, we produce a diagnosis as to which combinations 
of terms have resulted in hits. 

We also have a thesaurus of terms which the customer uses. When queries are sub- 
mitted to us | the first thing we do after having a dialogue with the customer and 
checking our interpretation of his query, is to select those data baaes which should 
be used for the customer. No customer uses all the 14 data baaes which are available 
to him. There is a table at the end of the paper which shows that of 800 profiles, 

62 use only one data base, 58 use 2, 63 use 3, 201 use 4, 187 use 5, 32 use 6, and 4 
use 7. Therefore, the performance for this kind of system cannot be given in a simple 
relevance assessment or precision assessment, because naturally the effectiveness of 
the different data bases will vary. The first thing which has to be done is to look 
at the different data bases, as in Cases 1 to 15 which are included in the paper. In 
these Cases, the relevance as measured by the customer varies from close to 100% to 
about 20%. And even when the customer receives only 20%, he may still be satisfied. 

So even a user evaluation in terms of precision does not actually state anything about 
the effectiveness of the system. 

In the beginning when the service was free, the only thing the customer had to do 
was to give us an evaluation of the results. These results are included in the paper; 
out of 36,000 references, 9,000 were of immediate interest, 2,459 were of iwiediate 
interest. but the customer had already read them, and 11,000 were of interest but not 
for immediate use. Sunning up these three figures comes to something like 60% preci- 
sion overall, which is quite good. It is important to stress that the interest of a 
customer is not static, but evolves continually. As soon as the customer receives 
information which is relevant to him, and which has given him something new to consider, 
it is quite natural that his interest changes. 

Finally, the effectiveness of search from the precision point of view is very de- 
pendent on the activeness of the user. We have some users who are very active and who 
receive very good results, whereas we have customers who in the beginning submit a 
question and then do not follow it up, which makes it very hard for us to know how 
the system works for this particular person. We have noticed that applied research 
workers are more inclined to want small packages of information. We have had customers 
telling us: M I receive too much relevant information; I can Y t swallow it, I don*t want 
this service." On the other hand, most people in basic research want a lot of output. 
They can afford the time to read more. 

PROF' P. PER LAND (USA) : I value this opportunity to talk about an approach to communi- 
cation that is not often heard in the information science and retrieval field. We move 
with my presentation from bounded, finite systems, ISR, to finite interactive systems 
in a dynamic environment. 

I assume that there are three components of information science. The first of 
these is the manual and technical operation of libraries, the usual library routines. 
The second is mechanized and automated ISR, information storage and retrieval. The 
third component which is emerging is a theoretical discipline for advancing the under- 
standing of the information function in all the disciplines. As a result, I will not 
argue with those who claim that information science is a discipline. It is indeed a 
discipline. It is based upon theoretical considerations, and rigorous research design 
is used with advanced knowledge in this field. 

However, despite library and ISR orientation, there are considerations which have 
caused us to raise questions about this communicative infrastructure called library 
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and information science. First , it seems to me that it is difficult, if not impossible, 
to evaluate a system from the inside and conversely, goals for a system cannot be set 
within the system. It seems that no evidence has appeared yet that either the library 
or information processing center is not a system. In any event, I myself am often 
frightened at what we are doing to people in training them to be librarians and infor- 
mation scientists. We call it a profession, but we emphasize a service orientation, 
a service distribution and an information supply function, precisely those aspects 
which in any other area of human endeavor would be called an occupation. Of course, 
librarians have human relations training. But what comes out most often is public re- 
lations, salesmanship, and that kind of thing, rather than communication leadership. 

The second consideration has to do with the relationship between a discipline and 
a profession. It seems to me that the pursuit of knowledge by means of a discipline 
is purposeful, perhaps even moral. It is certainly not concerned with teleology; it 
is not concerned with the social usefulness of the discipline. It seems to me that 
it makes little sense to question the validity of atomic research or computer research 
on the basis of the fact that these instruments may be used to destroy man. This is 
irrelevant to the concerns of a discipline. It is however a primary concern of a pro- 
fession whether these instruments will be used for social purposes - for the better- 
ment of man and for the good of our community. 

The position that we have taken is to study the flow of information in society 
rather than the route of ISR, whose main concern is the creation and regeneration of 
knowledge in the disciplines. Users' studies, as we presently know them, are largely 
concerned with the scientific use of information: how do researchers use information 
to create new knowledge, rather than to solve social problems. The simulation that 
we have developed is designed to study the use of information in political and commu- 
nity decision-making under conditions that we feel are isomorphic with reality. 

What we call library and information science is based upon the usual homomorphic 
and isomorphic transformation. On the other hand, the structure of the socio-economic 
system is based upon social power, i.e., the ability to satisfy people's preferences 
by means of economic prof ileration. 

ISR is not enough to ensure that information will surprise anybody. We believe 
that a mediating profession is needed to create conditions which will maximise the 
surprise value of the retrieval of data, and data remains data until it surprises 
someone. Until it gives you some information that you had not expected. In other 
words, the important problem is to ensure that people will pay attention to the infor- 
mation by arousing their curiosity. 

In our research we have used a variety of observational techniques; We are trying 
to develop new ones as we go along. Some we have used are semantic differentiation, 
attitude instruments, diaries, socio-metric devices both in the group and in the com- 
munity, interaction analysis and the content and context analysis video tapes. We 
use video extensively. All group meetings are video-taped and analyzed for behavioural 
patterns. Mass communication is built into the model - radio, television, newspapers, 
etc. 

I might add that we can play the simulated roles, the dynamic roles of County 
Commissioner, activist groups, business and industry, information specialist, floating 
librarians. You name it, we can play these roles both 0*11 and off-line. For research 
purposes, the off-line role playing is much more valuable. You can observe the be- 
havior taking place in actual human actions. 
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In summary, our major purposes are first, to provide a matrix within which the 
effectiveness of library and Information storage and retrieval can be evaluated; and 
secondly, to Identify the knowledge, attitudes and skills of a library communications 
profession. This Is our grand purpose, our overall objective, our hope to deal with 
the "Gestalt' 1 of human activity. Sounds global? It Is. However, 1SR Is quite 
manageable. It Is finite and It Is bounded, and I as afraid most librarians and most 
Information science specialists want a finite, bounded world. But I must point out to 
them that It Is a highly artificial world, particularly from the view point of people 
who live in communities and with community problems. 

QUESTION : To Prof * Wagner* Does the serial Jump strategy you developed have practical 
applications already? 

PROF* G* WAGNER (Germany) : i think it is very difficult to give you general solutions 

to the problems of table look-up techniques. It depends on what you are doing which 
strategy will be best. In most cases this Is done by trial and error; comparing dif- 
ferent strategies and seeing what takes longer and what takes less time. You then take 
the strategy which proves to be the quickest one. Because so many parameters go Into 
these search strategies. It is very difficult to give general solutions. What we had 
was one practical problem - the translation of words In strictly alphabetical order 
from one language to another. For this, we found this solution. But for other prob- 
lems you will find other optimal solutions. 

DR* COCKX: To Prof • Wagner • Do you mean that you can generalize your method to other 
disciplines? 

PROF* WAGNER: To Dr* Cookx • It Is not restricted to the medical field. It Is trans- 
lation of words. The SABIR system was set up In Paris with the help of our own 
Institute In Heidelberg. The center of the SABIR system Is still In Paris and we 
are collaborating on the creation of an information network. The first axis of this 
Information network Is Paris - Heidelberg and other axes have since coma Into being 
such as Warsaw and Belgrade. The principle of the SABIR system, which Is restricted 
to cancer literature. Is that each country which would like to participate comes In 
with Its own national literature In the field, puts It Into the system and receives 
the total of the system. I think It Is very economical. The central point Is still 
Paris where most of the work Is done. We In Heidelberg put the German literature 
Into the system. But we have also been collaborating on the preparation of the system. 

MRS, A*- « ROSEN HECK (Iorael): To Prof* Wagner* Who funds this project? Does every 
country which participates also shoulder part of the budget? 

PROF* WAGNER: To Mrs* Roeenheok * We pay our costs for putting the German literature 
Into the system. The French pay not only for the French literature at the moment, but 
also for the English and American literature because we have no partners yet In the US 
and UK. We hope to in the future and then they will pay for their own national lite- 
rature. My belief Is that every country should be glad to come Into a large Inter- 
national system. It Is much cheaper to pay only for the national literature than for 
the total of the system. 

DR* COCKX : Is It preferrable to retrieve documents by searching unprocessed text or 

matching descriptors assigned to documents and queries? I would like to know what is 
the future of automatic indexing and automatic translation? Can users' needs be met 
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better by rapid diffusion of title lists, by group profile services or by relatively 
slow, but high-recall/high-relevance retrieval services? And, under which conditions 
can the user benefit from on-line, time-sharing terminals? 

DR* W* E* BATTEN ( UK ) : To Mr* Aitchison * You used the word "best 11 and carefully inserted 
Inverted commas around it* In view of what followed from Mr* Bloch it might be worth 
our while if you would spare just a moment to tell us what in this particular set of 
experiments was your criterion of "best”, 

MR* AITCHISON : To Dr* Batten . How do we measure "best" performance? We were talking 
all the time about recall, fallout and precision, so we were getting the normal 
Cranfield curves* We got series of curves and if one curve is more to the top right 
than the other, we know the recall precision all the way through is better. It is 
when the curves come close that one has arguments* Essentially all the performance 
measures were based on recall precision and fallout* But we produced curves in 
different ways* 

MR. BLOCH: To Dr, Batten* Since we assumed that the number of terms will not be too 
great, we considered that the more terras the better* I stressed before that we assumed 
the number Is not so large that it becomes noisy again. We based this on the fact that 
we do not extract from the full text but only from the abstract where we may assume 
that only relevant information has been given* 

I mentioned that we assume that the greater the number of terms the greater the 
chance of an entry point for retrieval* This may increase the noise; however, we feel 
that abstracts will very seldom be noisy* 

MR. B * VICKERY (UK) : To Mr* Aitchieon* I believe that not many of the abstracts used 
in the INSPEC system derive from author summaries* If this is so, is this a factor 
affecting the precision of the search using the abstracts? And secondly, did you 
collect information about the amount of overlap between the documents recalled by the 
different searches? I have been connected recently with one system using titles and 
one using descriptors to search the same base* Both performed quite well, but the 
overlap between the two was relatively small* They were recalling different lots of 
papers. 

MR* AITCHISON : To Mr* Vickery . We tend in INSPEC to use author abstracts and data 
them as far as we can* Obviously a certain portion of the abstracts are produced 
separately because either there is no abstract or it is not suitable* 

On overlap, we find quite a difference* In general documents cither tend to come 
out or not to come out* If a document has a tendency to come out, it would come out 
with a fair number of the different languages* In other words, at each time there 
was this sub-set of documents that come out, you were topping up on the different lan- 
guages* If you think of it, there is a reasonably good reason for that* One can 
imagine the subject is clear and everyone latches on to it. 

MR* S * PASCAL ( Israel) ; To Mr, Aitchison and Mr. Bloch , When you speak of titles and 
abstracts, I understand that you mean that no treatment has been made in the input* 

That means the file contains titles or abstracts that are searched in natural language 
without any morphological or syntactical analysis* Does this mean that you didn't use 
such instruments as thesauri? If in the search you used thesauri or other aids, I 
cannot see precisely what the different is between this and the system you have 
described. 
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MR* AITCHISON : To Mr* Rascal* It is not the same to produce a controlled language or 
use a thesaurus. The thesaurus that provided the controlled language had been developed 
in setting up the data base 9 whereas the thesaurus that we were using for the natural 
language was the EJC Thesaurus and various others. 

MR* W* URLMAM (Sweden): To Mr* Aitohison and Mr* Block* I think one doesn't take 
sufficiently into consideration, considering computer costs, that a search using un- 
controlled vocabulary is considerably more expensive than a direct text search. 

Secondly, I think one sometimes oversimplifies the problem because, for example, 
if I use card string search into a character string, title for example, it is quite 
different from using a character string search in a string having some thousand charac- 
ters such as abstract. The second needs much more character string routines. It is 
not sufficient that one sees that these character strings are simultaneously present 
in the abstract. The first might be in the beginning of the abstract and the second 
might be at the end, and they ray have nothing to do with each other. In such cases 
it is essential to be able to state that these two character strings shall be in the 
same phrase, in the same sentence, in the same paragraph and shall be separated from 
each other by no more than, let's say, one word or three or four blanks and so on. 

This in its turn means that these character string teams become considerably more 
expensive. 1 have experience with two fairly large data bases. I can get from both 
roughly the same relevance figures. 

A third point is that it depends very much how structured this data base is. If 
you have subject headings and categories and key words and abstracts, it is much 
easier to formulate reasonable and sensible questions, and it is much cheaper. On the 
other hand, if you have an unstructured data base you can get about the same results 
with more advanced routines if you haven't got only titles, and that depends also on 
what kind of title. But the unstructured data base demands approximately twice as 
many terms in the question as the structured data base, and that is a very considerable 
increase in cost. With controlled vocabulary you know what you have to ask. Let's 
take a question: computer traffic. On the open data base I search under the subject 
heading computer software, for example, only with the character string traffic. Ask 
it in an unstructured data base. You can imagine what you have to enumerate to get it. 

MR* AITCHISON : To Mr* Uhtmann* Control versus field text in processing - I agree that 
one will be more efficient than the other, in computer time. We did not do context 
searching, but we made a somewhat abortive attempt to see whether context searching 
would have helped us, for instance, to overcome the fact that we got very low precision 
in the abstracts. In general, and that was on hindsight, the same sentence was going 
to help the precision, but would reduce the recall. It was surprising the number of 
times that what you wanted would only come out in the abstract because there was one 
term at the beginning and another at the end. 

Regarding the structured data base, I entirely agree. Because the data base 
obviously only covers a certain, if multidisciplinary, field. Since we have the 
classification imbedded in our data base, it means that we can combine the classifi- 
cation with free text searching and make sure that we have partitioned the data base 
so that the use of the word will be very precise although it is nstural language. 

MR* BLOCH; To Mr* Daeoal and Mr* Uhtmann* What we wanted to know is not whether this 
will perform well, because this obviously is a question of the kind of system you use. 

In order thst the system can work you must have a certain amount of input dsta. What 
we wantsd to know is whst input data will be sufficient in order that with fsirly 
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sophisticated techniques we will be able to retrieve something* We wanted to know 
whether it is enough to have the indexing terms or whether it is enough to have the 
abstract , or whether we must use both because they do not overlap sufficiently* After 
we have the data base we must use a retrieval system which is sufficiently sophist!* 
cated to differentiate between character strings that are consecutive and those that 
are not consecutive* We were fully aware that you cannot work with pure Boolean 
algebra on an abstract* In 1967 we already had what we called end terms* A certain 

, number of words have to be consecutive or must be within a given number of words or 
must be within a phrase or must be within a section* And only by using sufficiently 
sophisticated techniques can you go to an abstract* It will be disastrous to go with 
a simple Boolean algebra system into an abstract* There is no question that when you 
want to work on a long character string you must provide sufficient sophistication* 

Our research was based on the fact that you have indexing terms and abstracts and you 
want to know which is better* But we wanted to go into production and not research 
only and we have to live on data bases we buy from abroad* Most of them are unstruc~ 
tured, and therefore we have to develop our own systems* Title is very nice as long 
as you work from highly scientific subjects but the moment you come to engineering, 
the titles have no information* 

QUESTION: To Mr* Aitohison and Mr* Blooh t Several years ago Bloomfield of Hughes 
Aircraft Co* did some studies on comparative retrieval effectiveness taking terms 
from titles and from vsrioua abstracts* He did comparative evaluation on retrieval 
effectiveness* I Just wondered whether his work fits into this* 

Mff* AIT CHI SON: We have seen most of these studies* Our feeling was that the Cranfield 

approach was the moat productive, although it haa its critics* One of the reasons we 
were happy to continue in that particular way was that none of the critics seemed to 
have produced an altarnative method which any of the other critics accepted* 

1 W* BLOCH: The ides was to find out whether abstracts or indexes slons srs sufficient 

for your system through methodology which can be applied by anyone with two rasa on ably 
intelligent persona, one who knows English well and one who ia a subject specialist* It 
is methodology onlyu For every data base it depends on what the indexing end abstract 
.policy is* Only that decides whether the index or the abstract is better er whether 
’you cannot use one without the other* 

HRS. 4*M* DE BUSTAI4ANTE (Nexiao) : Eegarding the question of whether abstracts alone 

are sufficient, I would like to recoil an article by R.L, Ackoff describing a wary 
interaatlng experiment in which a certain nuaber of publications eelectad by a special 
committee of the Univeraity of Pennsylvania were given to a number of students in two 
forma* Firet, abatracta.of articlaa classified by the authors as good or poor articlee 
were given to two groups of graduate students* The same authors then performed a test 
for content comprehension of the abatracta of tha articles* Students ware then given 
both tha abatracta and the complete article* A further comprehension teat showed 
that on articlee of poor quality the comprehension of the students was better when 
they read only tha abatracta; whereas on the good quality articles the students had 
better comprehension when they read the complete articles* 

MR, NIEVERMYR (Gemany): To Mr. AitohtBon* How large a nuaber of items did you 
conaider in meaauring your performance and if the number was very large, how did you 
practically meaaure recall, since thu procedure ie very time consuming* 
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MR . AIT CHI SOU: To Mr . Niedemeyr . We took advantage of our particular situation. We 
had at the time 600 users of our SD1 service. They had been receiving weekly outputs 
and also lists of all items in the data base. So from them ve had all assessment of 
the relevance of some 1,200 documents. We then asked them to submit a question in 
the context of their SDI profile. All ve had to do now was to supply them again with 
a copy of all the documents that they had already assessed as relevant to their profile. 
This gave us a relevant subset. Thus we were able to ask them on the average to look 
at the assessment of relevance of some 30 documents. 

Mi?. H, FANGMEYER (Italy); To Mr . Aitchieon . When you compared the abstracts and in- 
dexes, did you also foresee automatic indexing? We had performed some tests on automa* 
tic indexing in the framework of the nuclear documentation system. If we speak about 
Indexing, we should also speak about automatic indexing, because we found that it Is 
preferable to manual indexing. We found that the consistency between manual and 
computer Is about the same as between manual and manual. On the basis of more than 
500 documents we found that the recall and precision curve is much better for automa- 
tic indexing than for manual indexing. 

Mi?. AITCHI50N : To Mr Fangmoyer . It seems to me that the one part that the computer 
does not do well is to find out which of a large number of words in a paper are going 
to be the most relevant. I thought that this had been the experience in the automatic 
Indexing experiments that have been carried out in the US. 

MR ♦ S. ISAACSON (Israel): I think two points have not beer brought up. The first 

point is the Indexer himself. If he is an expert in the field, the chances of relevance 
being high according to the retriever is going to be high. In several of the informa- 
tion centers I was associated with, we found that if we need an expert to do the in- 
dexing ve usually got very high retrieval effects. The number of terms used varies 
from subject to subject, but on the average we found about 5 to 7 terms adequate. There 
is one other point. If one of the index terms is the category, and if you can set a 
weight on some of the other terms, then your relevancy of retrieval will be high. If 
you want to get radar measurements of the moon and the category is radar, and you set 
a very high retrieval margin on the word "moon", then certainly you will get very high 
retrieval even with a low number of terms. 

MR . BLOCH : To Mr. Isaacson • We were trying to find out what happens when you use 
commercially available data bases. If the service from which you buy your data base 
employs highly specialized Indexers, very possibly the indexing will be of such a 
high quality that it will be sufficient to use indexing terms only. We wanted to de- 
velop a methodology which could be used every time you want to incorporate a new data 
base in your own services. From the experience Mr. Lancaster hail when he checked the 
MEDLARS system, it appears that It Is extremely fruitful If the Indexer is on the spot 
working with the search team because there is a kind of cross pollination, since the 
searcher knows what the indexer has put in and the Indexer knows how the search will 
be done. But that can only bo done if you use a data base where you do your own in- 
dexing and your own search. Many c£ us use systems like that, In industrial firms 
where you want to use your own classified material or In a large international 
Installation like MEDLARS. Until Lancaster came along, MEDLARS kept the indexers 
•nd searchers completely separated and ware surprised when It turned out that one 
didn't know what the other was doing. 

We worried for a long time about weighted systems, but we decided on the Boolean 
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for two reasons. It makes for a faster through-put because the Moment one of the end 
tens of the profile is not fulfilled, you go directly to the next profile. Also it 
is More difficult to get a customer to make a weighted profile than a Boolean one. 

Tha scientists can feel what happens when they make a Boolean profile, even if it is 
a sophisticated one with strings of characters that must be together. When you ask 
them to put wieghta on, thay feel completely lost. Recently I was teaching students 
on the use of computers in information work and I was surprised that even in teaching 
It was difficult to make them understand how the weighted system works. But it was 
very easy to explain how the Boolean system works. 

I muat mention one advantage to a weighted system. When you use a weighing sys- 
tem, you can update your profile automatically. So far we have not found a way to do 
that with the Bool tan system. If someone here has some idea how to do that with 
Boolean, I would be extremely grateful to hear about it. 

MR, C, KEREN (leraal): I would like to offer a suggestion concerning a system we 

are trying out just now. Wc take titles of articles and reports which have actually 
been requested by customers and extract from them the key words and compile a list, 
of key words which we hope might later be useful. We think in fact they are useful 

as an aid to profiling and to asking retrieval more effective. This is still very 

much experimental, but I think that our system might to a certain extent be an aid 
instead of automatic, indexing. 

MR, PASCAL: To Mr, Hjnrppo , I*d like to know why the evaluation took so much time? 
About two years for each case study. 

MR, HJERPPR : To Mr, Daeoal, Case studies which are included were selected for three 
reasons. First, the customer should make an evaluation over a long period. Second, 
the case should concern different subjects; and third, it should concern different 
types of users - government, industry, etc. 

MR, J, DE LA C IE MAN DIE RE (France) : To Prof, Panland, A propos de 1' excel lente analyse 

du processus de la communication faite par M. Pei; land, Je dcsirerala formuler deux 

remarqueat - be processus de eotmm lent ion est presque toujoura schematise par les 
interactions: Emettcur-Keceptcur, sans toutefols que l’un se preoccupe beaucoup du 
contenu du message, sinon pour des raisons de cout de transmission. 

Co pi obi erne eat aaacc bien resolu par la theorie de l'lnformatlon (information 
theory) qui a anticipe celle de l 1 inf ormation sclentlf ique, Toutefoia, en dehora de 
sujetiona materiellea, cette theorie se revele a 1 'experience peu fc condo pour ce 
que nous nvons a falre. 

- Seconds remarquet Monsieur Penland propose une approche cybemitique de la 
communication, qui s'avere plausible bien d’avantage au niveau dea structures, qu'a 
celui de la signification. Ne rappclle-t-il point quo "l’homme eat plus frland de 
structures que de pain"? 

Son allualon a la "thaorie dea systems" stipule, que la structure represente, dans 
une tranche d'eapace et de tempt, l'cquillbre an function d'un but. Cette definition 
nous conduit a penaer que la structure est une information (LAB0R1T) , mala il nous 
rests a connaltre la signification de cette structure. Et a l'hsura actualle, las 
ayatemea de traltcment de 1* information qui se redulaant a des systems de traltement 
de flchiere, pratiquament depourvua da syntexe et de valeure aemantiquee, ne noua 
permettent pas da noua fairs une Idee dea significations. 
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Ces systems sont blen elolgnea des h oncost at a sux quel M* Pen land fait allualon* 

La raison de cette divergence reside dana un« double confualont confusion qua l 1 on 
commet entre 1* informat ique et l f information, et confualon que l'on entratlent entre 
lea ordlnateura et l'lnformatlque* 

Cette double confualon Juatlfe du meme coup lea KWIC, lea doublea KWIC, lea pro- 
ceaaua de dlffualon selective d' information, de diffusion automatique de publication 
secondalrea, lea reaeaux documentalrea non optimises, la linguist Ique atructurala, 
tous produlta de notre "technoetructure" qul melange dellclauaement neceaaltea et 
aspirations sous le notn de beaoina* Pour ma part je regrette que noua aoyona tombaa 
dana le plege des bcaolns, ce qul noua amen a a effactuer un certain nombe da tachea, 
qul ne se justlfient paa, mala servent prlnclplement a noua Justlflar* 

On a tendence a tralter les documents comme dea produlta Induatrlela mala alora 
que l’on connalt avec precision, a la foia, la structure, le contenu, la signification 
et l'usage d’ elements atockea en con teneurs, lore qu f ll a 9 aglt de "mcmolras documen- 
talres", on Ignore la plupart du temps la signification de ce qu v ellea contiennent. 

Cette carence eat prlnclplement due au fait que laa analyseura - prograMeura 
Ignorant lea motlvatlona dea utlUaateura et que lea utlllaataura maconnalaaant eallea 
dea analyseura* Notre schema: Eaetteura-Recaptaura, s' 11 a'aglaaalt d'un veritable 
dialogue: homme-machlne , devralt comporter lea feed-back d’autoragulationa naceaaairea, 
dont lea ayatemea qul noua aont proposes, aont ganeralement depourvua* 

La communication de M* Penland eat au coaur da nos preoccupations* 11 en eat de 
meme dea travaux d’Oagoo^ et d’Hayakawa qul noua ont aide cn France a fcrmuler lea 
premlaaes d'une theorle paychologique de 1* information* 

PROF. PENLAND : 1 want to atart off with that black box which repreaentr ISt. It 

rapreaenta tha sum total of recorded knowledge and unless knowledge la recorded It 
la not In the black box* Documents ara arranged by content In a way that repraaenta 
the totality of knowledge* In addition to that, the librarian-information aclantiat 
doea a curious thing* He transforms that document atore Into a homophorlc aet of 
daacrlptora, backed up by abstracts In thia ayatam* And tha so-called aclance of 
Information aaya that these documents have been aciantlf icelly transformed Into a 
aat of abatracta and descriptors* 

If you have transformed these documents into daacrlptora, why can’t you racon** 
struct these documents from the descriptors? The answer la "Well, you know we can't 
do that"* This, 1 think, aaya something about tha bounded finite ayatem of library 
and information science* 

Vainer developed the adaptive control mechanism ACO* The only thing that will 
tll^ger this adaptive control mechanism la something that surprises It* If It isn’t 
surprised, it won't pay attention* 1 Platen to the sound of the air conditioner * 1 

don't pay any attention to It because It doesn't surprise me* But If 1 were to hear 
music coming from the air conditioner, this would surprise me and get my attention* 

Until information has that surprise element you can have all tha libraries and infor- 
mation storage unite in the world, but they won't mean a thing to anyone unless thare 
la aoms aurpriaeful relationahip between the data and the individual* 

Ualng thla kind of communication model, where 1, tha receiver, go to a number of 
aandara who coaronicete to ma, and go through a mapping process and aaarch for 
serendipity and if something surpr/leee me, 1 pay attention* Thla la a modal diametri- 
cally oppoaad to tha usual one, even tha one uaad by Shannon, which la the kind of model 
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where you have • sender and many receivers. Most communications theory is based 
upon thst kind of one-way model. However, the library and information science pro- 
fession Is based upon another kind of model , the potential of which is not completely 
realized. I initiated this theory and will follow it up only as long as I am getting 
some data that surprises me. 

MR, BLOCH : To Prof • Penland . I just want to remark that this idea M if it surprises 
me | it’s information" is actually what Shannon did. But he says as long as you can 
predict it there is really no content in it. And it is only when something surprises 
you that you cannot predict, you call it information. 

PROF, PEN LAN D: To Mr, Bloch • Precisely. Information centers have all the data in 
the world; I don’t know whether they have any information at all. 
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SYNOPSIS 

Information science education has developed into a variety of 
forms in different countries. An analysis of these educational 
programs and their developing tendencies toward a .more uniform 
set of practices suggests that a common pattern for information 
science education is emerging. This pattern is described and 
conceptualized by means of a two-part classification scheme-- 
one dealing with the levels of competence of the trainee and the 
other with the orientation and content of the courses. 



As one investigates the programs and procedures for education in 
information science, one is impressed with the diversity that 
exists. As one searches through the literature, problems arising 
from diversity are quickly encountered. Under what subject heading 
should one look? In the United States, the commonly used name is 
"information science". In the USSR and kindred countries the pre- 
ferred term is "informatics", and in many European countries, the 
tendency is to use the term "documentation". The FID/TD Training 
Committee, in whet we may assume to be a spirit of compromise, has 
published its very excellent report under the title "A Guide to the 
World’s Training Facilities in Documentation and Information Work" 
[ 1 ]. 

In addition to the differences in the name, there is also a dif- 
ference in the place where the training takes place. In some coun- 
tries information science education is connected with schools for 
librarianship. In other countries special institutes have been 
organized to provide training for documentalists and other informa- 
tion personnel. The level of training also varies; it may take 
place at a secondary level, at a technical college or at a univer- 
sity. 

The cultural, economic and political regime, particularly the 
difference between socialism and capitalism, influences all aspects 
of national life including scientific information and education. 

In a very enlightening article, Professor Majewski of Poland points 
out that [2 , p . 15] : 
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"In all countries with a socialist system of economy 
three main scientific sectors can be discerned, each 
possessing their own scientific and research institutes. 

There are scientific and research institutions existing 
within the framework of the 

(a) national academies of sciences 

(b) academic schools, 

(c) industry, services, organs of health protec- 
tion, etc." 

There are also different levels of training, and Professor Majewski 
goes on to explain that documentation technicians are graduates of 
the higher secondary schools who have, in addition, completed 
special documentation school courses of two years in length. Doc- 
umentalists are trained in the academic schools and by means'* of 
extramural courses which provide instruction on the aims and uses 
of scientific information as well as in the skills of processing, 
classifying and disseminating documents. There are also post- 
graduate courses, and in the USSR one is able to obtain the degree 
of "doctor of information science". 

The importance of the embedding educational system and how this 
effects schooling in documentation can be easily seen through the 
training policies and trends in The Federal Republic of Germany. 

In 1965 Professor Cremer wrote [3] that, "Because of the different 
education systems in the German gander 1 and the fact that the 
teaching of librarianship is not included in the university cur- 
ricula in Germany but takes place in the so called ’Bibliothekarlehr- 
institute ? ". . .it was necessary to promote training activities at a 
national level. Such training became the responsibility of the 
Deutsche Gesellschaft fur Dokumentation in Frankfurt. This society 
offers special courses, examinations and a diploma. 

More recently, at a meeting of the FID/TD Committee in April 1970, 

Mr. Marloth described some current educational developments in the 
Federal Republic of Germany [4] . He pointed out that now a clear 
distinction is made between: 

(a) education and training of librarians 

(b) education and training of documentalists , and 

(c) education and training of information scientists. 

The educational requirements for documentalists, at the highest 
level of competence, are a master’s degree, good knowledge of the 
English language, one year of practice in a documentation center 
and a one-year part-time course at the Lehrinstitut of the Deutsche 
Gesellschaft fur Dokumentation. At the completion of training, the 
documentalist should be qualified to assume a leadership role in 
documentation centers. 
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For information science, Mr. Marloth explains that at present there 
exists no regulation curricula in German universities for this sub- 
ject; however, a study group has been commissioned to examine the 
form and content of such educational training. It is anticipated 
that a master's degree in information science will be established 
and that this will be followed by a doctoral program which will be, 
essentially, a research degree. 

Current training programs in the United Kingdom have been described 
in two articles, one by Farradane [5] and the other by Schur [6]. 
The latter paper is particularly noteworthy for it mentions some 
of the problems in information science education that are faced in 
the U. K. and, as the author says, no doubt in other countries as 
well. Included among these problems are the following: 

(a) It is difficult to interest a sufficient number of 
good young scientists for training as information 
specialists; 

(b) Existing first degree, and even post-graduate 
programs, are of a general nature lacking an 
adequate balance of specialized courses in, 
for example, mechanized information retrieval, 
reprography, management, etc.; 

(c) The expansion of courses in information science 
is limited by the shortage of qualified teach- 
ers; 

(d) Research and development in scientific and 
technical information takes place outside the 
schools of librarianship and information 
science, and there is a problem of how to 
integrate the results of this work quickly and 
efficiently into classroom teaching. 

Efforts are being made to alleviate these problems, but the solu- 
tion is still a long way off. 

It is tempting to continue in this vein and to discuss the 
separate educational programs of other countries such as Japan [7] , 
the USSR [8], Canada [9, 10], the United States [11, 12, 13, 14], 
Israel [IS] , and the countries of South America [16] and Africa. 

It is tempting, but the cataloging of differences will not be par- 
ticularly enlightening nor can a brief mention do justice to these 
individual developments. One can easily become impressed with the 
differences that are present and loose sight of the similarities 
that are equally prevalent and which are, perhaps, of greater 
importance. 
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The diversity and experimentation in educational procedures that 
exist are part of a necessary stage of development, but now as inter- 
national activities expand there is a need to increase the similar- 
ities among the training programs and decrease the differences. As 
educators from different countries exchange experiences, and as they 
become more familiar with each others f achievements, there will be 
a mutual influencing of the training programs, of courses, text 
books and teaching aids. Information is an international commodity; 
no one country has a monopoly. Information must be disseminated 
across boundaries, and international cooperation is essential. The 
desire for a free exchange of information contributes toward coop- 
erative efforts, and these in turn bring about a greater uniformity 
in the training programs and in the competence of the information 
specialist. 

Efforts for furthering cooperation and for raising educational 
standards are facilitated by such organizations as FID, UNESCO, 

ICSU/AB, and others. Not only do these organizations provide a 
forum for the exchange of ideas, but they engage in projects that 
have an international scope. A project of particular significance 
for information scientists, and one which will have a great effect 
on the standardization of educational training is UNISIST. The 
UNISIST program envisions [17, preface pp. VI and VII] : 

"The unimpeded exchange of published or publishable j 

scientific information and data among scientists of 
all countries; The cooperative development and 

maintenance of technical standards in order to j 

facilitate the interchange of scientific information \ 

and data among systems; The promotion of cooperative \ 

agreements between and among systems in different j 

countries and in different areas of the sciences for 1 

the purpose of sharing workloads, and of providing j 

needed services and products; and the development i 

of human and information resources in all countries j 

as necessary foundations for the utilization of ma- \ 

chine systems." i 

| 

Among the recommendations that the UNISIST study committee made is j 

one [17, pp. 56-57: recommendation for #13], which deals with man- 
power development. j 

l 

"For all nations to take an active share in the | 

operation of international information systems, a j* 

concerted effort is needed to provide information J 

specialists, librarians and documentalists , with | 

improved educational facilities... Attention should 
be given to the desirability and feasability of inter- 
nationally oriented training and educational assist- 
ance programmes, which might include proposals for 
pooling resources, where needed, in a number of re- 
gional education centers." .* 
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What then are the common trends that can now be identified as one 
studies the educational programs of information science in various 
countries of the world? Can these programs be catagorized and 
classified? The questions are difficult ones to answer because the 
programs are still evolving and any attempt at classification will 
necessarily have to ignore significant individual differences* 
However, all programs are not unique; similarities do exist and 
more will emerge. Perhaps the best way of escaping the dilemma is 
to avoid classifying the existing education practices and to de- 
velop a conceptual taxonomy which will help us to understand the 
full range of programs in information science education that seem 
to be developing. With this in mind, I would like to suggest a 
two-part classification scheme; one dealing with the levels of 
competence of the trainee and the other with the orientation and 
content of the courses. 

Education in the information sciences, documentation or informatics 
will be seeking to train people at three different levels. These 
are : 



(a) the Information Technician: Requiring a 

baccalaureate or first degree 

(b) the Information Specialist: Requiring a 

master T s degree 

(c) the Information Scientist: Requiring a 

Ph.D. or equivalent doctorate degree. 



Education at the technician level will be of a very practical 
nature emphasizing the analysis, organization and dissemination of 
information by both manual and automated techniques. People with 
this degree will make up the majority of the workers in documenta- 
tion centers. 

The information specialist is equivalent to the professional 
documentalist . He will receive more intensive basic training plus 
additional training in languages and will acquire competence in a 
subject specialty which may be one of the sciences or management. 
Graduates of this program will be qualified to take a leading role 
in the organization and management of the documentation center. 

The doctorate degree in information science is a research degree, 
and the information scientist will be qualified to conduct research 
and to teach. 

Four different orientations to the organization of an information 
science curriculum can be identified [13]. These are: 



(a) the library oriented curriculum; 

(b) the computer science oriented curriculum; 

(c) the systems oriented curriculum; and 

(d) the information science oriented curriculum. 
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The library oriented curriculum tends to identify information 
science with documentation,, and these programs are frequently 
located in library schools. The emphasis is on the use of com- 
puters in libraries, and the course of study includes library 
automation, circulation control, on-line cataloging, etc. Also 
included are courses in information retrieval and dissemination, 
automatic indexing and abstracting, and the automation of various 
library functions. 

The computer science oriented curriculum places primary emphasis 
on the computer. Such programs are generally located in engi- 
neering schools, and the course of study stresses the mathematics 
and logic involved in computer hardware and software design. Ap- 
plications are also a topic of study, but these too emphasize 
engineering and mathematics, as in the use of computers in simula- 
tion studies or to represent recursive functions. Other applica- 
tions that might be included are linguistics, information retrieval, 
data base organization, public health, etc. 

The systems oriented curriculum emphasizes the methodology . of 
systems analysis as it applies to one or more institutional environ- 
ments.' The curriculum deals with the analysis and design of infor- 
mation systems, information retrieval, library networks, etc. Stress 
is placed on the management and decision-making aspects of infor- 
mation systems rather than on the operational and service needs. 

The information science oriented curriculum is based upon the premise 
that information science is a distinct discipline with its own 
methodology and research interests. Schools having this orientation 
emphasize theoretical courses drawn from the formal disciplines of 
mathematics, logic, and perhaps linguistics. Courses in computer 
utilization, language processing and data base management systems 
are also included. The emphasis of the entire program is theory, 
methodology, and research. 

These are the discernible trends in information science education. 

The course structure may have any one of four different orienta- 
tions, and within any given structure, students may receive training 
at three different levels of competence. 

The challenges facing education in the information sciences of this 
particular moment in history are very great. The universities 
throughout the world are in a furment. Students everywhere are 
clamoring for more relevance in their courses of study. However, 
information science is in an enviable position. What can be more 
socially acceptable than studies that lead to improved methods for 
gathering, processing and disseminating information to all who need 
it and for the betterment of all mankind? 

The study of information science is now in its infancy. The pattern 
for its future development is reasonably clear. It is up to us, the 
educators in this fieid, to nuture and accelerate the growth of 
information science into a fully accepted discipline with its own 
body of theoretical knowledge and useful application. This is the 
challenge that we face and that we must meet. 
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TRAINING IN LIBRARIANSHIP AND 
INFORMATION SCIENCE IN ISRAEL 

Lydia Vilentchuk 

Head, Dept* of Training and Publication 
National Center of Scientific and Technological Inforraaticn, Tel Aviv 




SYNOPSIS 

Development of education in all trends of librarianship and 
information science on non-graduate and graduate levels in 
Israel is reviewed, and statistical data about graduates for 
the last 14 years is given* Forecasts of needs in trained 
man-power for the next 10 years are made. Importance of edu- 
cation of users of information from an early age is stressed* 



To given an indication of the number of potential users of the services cf special 
librarians and information scientists, as well as the required trends ani depths of 
their training, let us first look at some pertinent Israeli stati8tics*l|2,3 



Total population in 1971 
(approx* 17% non- Jews) 

Primary schools 

High schools 

Institutes of higher learning 

University- trained manpower in 
science and technology, including 
medicine and agriculture 

Master's degrees granted in 1969 
in science and technology, including 
agriculture 

Doctor's degrees granted in 1969 
in science and technology, including 
medicine and agriculture 



approx* 3,000,000 

1,442 (456,356 pupils) 
246 (138,850 pupils) 

7 ( 37,343 students) 

approx* 25,000 

400 

220 



School libraries - exact figures not available 

Public libraries 611 

Special libraries - 283 

Academic libraries - 7 



I 



( 

f 
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These figures provide the statistical background to the subject. However, there is a 
great gap between the number of potential and actual users of libraries and informa- 
tion services. The Israeli public lacks habits and training in the use of libraries 
and recorded information. This can be explained by the curricula of the primary and 
high schools of the country, which do not provide systematic training in the use of 
libraries and thus do not foster informatiori-mindedness from an early age. The great 
influx of new immigrants with their language difficulties and different backgrounds 
in schooling and reading habits is also not conducive to the creation of information- 
mindedness among the population. 

Most secondary schools, primarily administered by the local authorities, have recently 
established libraries, but most of these libraries are not in the charge of trained 
librarians and no formal orientation classes in the use of libraries and of published 
material are offered to the pupils. The situation is even worse in primary schools, 
which are administered by the Ministry of Education and Culture, In most cases these 
schools do not have any libraries at all, as the Ministry does not earmark special 
budgets either for library premises or for the salary of a librarian, and only small 
budgets for book purchases. The collections are in the charge of a teacher-librarian 
who devotes not more than 8-12 hours per week to this work. It is interesting to note 
that the Ministry considers the work of a teacher-librarian less qualified than that 
of a teacher, and equates one hour of teaching assignment to one and a half hours of 
a teacher-librarian^. 

It is curious that although Israel is among the most book-minded nations it does not 
yet have a Public Libraries Law, ai.d the provision of library services to the general 
public still depends on the goodwill and insight of local government bodies, trade 
unions or other institutions, which undertake to provide such services. However, the 
Ministry of Education and Culture has recently begun to establish public library net- 
works in various areas, especially those settled by new immigrants, and has prepared 
a draft law on public libraries to be submitted to the Knesset (Parliament). 

Until 1956 the only body providing education and training of semi-professional tftfcii- 
power for public libraries was the Libraries Division of the General Federation of 
Labour (Histadrut), whose activities in this field started long before the State of 
Israel came into being. The first courses offered by this Division were one-day 
courses, which later developed into two weeks Intensive boarding courses. Their 
syllabus included simple cataloging, Dewey classification, physical care of books 
(very important in a hot climate), bibliography of reference and standard books, 
with emphasis on Ju'daica and modern Hebrew literature. 

An important step forward in the training of non-graduate librarians has been made 
by the Israeli Library Association, founded in 1953. In 1964 the Association intro- 
duced professional examinations in librarianship and is now the only body which grants 
recognised certificates of proficiency in the library profession on a non-graduate 
level. The Association is now engaged in n revision and enlargement of its examina- 
tion syllabus. Examinations take place at two levels: M A M general basic knowledge in 
library techniques, "B" a more advanced level with specialization in either public 
librarianship or special librarianship or special librarianship (see Appendix 1). 
Successful candidates are granted a certificate of Assistant Librarian” or "Librarian 1 
respectively. Between 1965 and 1970 the Association granted 170 certificates of 
"Librarian” in the public libraries, and 30 in the special libraries 5 . 



Trainuig in Israel 

Courses leading to these examinations are offered by various organizations: the 
Library Association, municipalities, and the Libraries Divisions of the Hlstadruth 
and of the Ministry of Education and Culture. For the "B" level examination In spe- 
cial librarianship the only body offering preparatory courses is the Extramural Stu- ' 
dies Division of the Technlon - Israel Institute of Technology, which In 1958 started 
evening courses for this purpose In Tel Aviv and Haifa. These courses became a 
regular feature; at first of one year's duration, they were later extended to two and 
a half yeara and now comprise over 300 Instruction hours. 

In the training of librarians at the graduate level, an Important development occured 
In 1955. Through the good offices of UNESCO, Prof. Leon Carnovsky of the University 
of Chicago Library School was Invited to prepare a report on the library situation In 
Israel^, On his recommendation a Graduate Library School was established at the 
Hebrew University In Jerusalem In 1956. The duration of the course Is one academic 
year and the minimum entrance requirement Is a Bachelor's degree. At present a dip- 
loma course, the management Intends to extend the studies to two years, leading to a 
Master's degree In librarians hip. This development Is expected to take place In the 
near future. In the second year of study the curriculum will accomodate different 
trends In llbrarianahlp; at present the School does not cater adequately to the train- 
ing needs of special librarians and certainly not of Information scientists. 

(kit of a total of 300 graduates during the 14 years of the School's existence, only 15Z 
are employed In special libraries, aa against 652 In academic libraries? • This can be 
explained partly by the curriculum of the School, which emphasizes conventional library 
techniques, sources of Information In the liberal arts and Judalca and the administra- 
tion of academic and public libraries (see Appendix 2). Partly It Is due to the educa- 
tional background of the students, aa the majority (68.72) possessed first degrees In 
the liberal arts and In Judalca, 23.32 In the social sciences, and only 92 In the 
natural sciences. No student had a degree In engineering or medicine. 

To fill the gap, the Center of Scientific and Technological Information of the National 
Council for Research and Development, In collaboration with the Welzmann Institute of 
Science In 1968 began to offer a postgraduate course In Information science. In 1968 
and 1970/71 (In 1969/70 no course vaa offered) 7 and 13 students respectively gradu- 
ated (13 la an estimate, aa final examination results are not yet available). 452 
of the students poaaeaaed degress In the natural sciences, 352 In engineering, 102 In 
social sciences and 102 In the liberal arts. 

This course la financed by the Ministry of Labour and the Ministry of Defence. The 
graduates are In great demand and more positions than students are available. All 
students found employment either with research Institutes or with Industry. However, 
because of budgetary problems and the difficulty in recruiting teachers, the course 
will probably be continued on a biennial basis only. 

As schools and Institutes of higher learning do not Introduce their students to the 
use of libraries and literature, all training In llbrarlanshlp and Information work 
In Israel has to devote a dlsproportlonally large part of the curriculum to basic 
Information on the use of catalogues, classification schemes, reference books, etc., - 
skills the student u should be familiar with before embarking on professional studies. 

The newly trained Information scientist will only seldom be able to rely on the support 
of an experienced librarian or have the benefit of starting hla career under the 
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supervision of a senior. A considerable amount of his time will be devoted to the 
supervision of routine techniques. This, and the development of oral and written 
communication skills - sadly neglected at all levels of education in Israel - have 
Influenced the shaping of the syllabus, (see Appendix 3). 

Some institutes of higher learning have recently begun to introduce library orienta- 
tion periods for undergraduates and it is hoped that in the near future such periods 
will become a general feature at all of them. The Center of Scientific and Technolo- 
gical Information last year sponsored two short courses in the use of chemical litera- 
ture, one at an undergraduate level at the Hebrew University and the other at a 
graduate level at the Weizmann Institute of Science (see Appendix 4) • 

The need for trained information workers on all levels in the near future can be 
evaluated from the following: 

The libraries Division of the Ministry of Education and Culture conservatively esti- 
mates that after the Public Libraries Law becomes effective, a total of 150 new 
librarians (both graduate and non-graduate) will be required to staff new public li- 
braries during the first five years, and approximately the same number for extension 
of services of the existing libraries and for replacements. 

How many professional librarians will be required to staff the primary and secondary 
libraries is difficult to predict. This depends largely on the readiness of the 
Ministry of Education and Culture to train pupils at an early age in the use of in- 
formation media. But even lacking the official blessing of the Ministry of Education 
it can be safely predicted that the number of school libraries and librarians employed 
in them will grow in the very near future. Presently only 2X of the Hebrew University 
Library School graduates are employed in school libraries'. 

Considering the need for information scientists, and starting from a cautious evalua- 
tion that for every 1O0 potential users - scientists, medical men, engineers and 
senior management personnel - one information scientist will be required during the 
next 10 years, 250 information scientists will have to be trained during this period. 
In this estimate the number of 25,000 potential users is taken as static; the yearly 
growth of the user community by approximately 650 equals the number of those retiring 
from active professional life. Any addition of users through immigration is presumed 
to be covered by an additional information scientist from the same source. A ten 
year period is chosen because the integration of information scientists into the 
economy of the country takes time, as the Israeli scientific and technological com*- 
munlty is not yet information-minded and therefore not prepared to absorb a large 
number of information scientists today. As to special librarians (see statist's 
p. 1) - these are presently very few (see statistics p. 2 and 3), and they are to be 
found mainly with research institutes and university and government departments. In 
industry only the largest organizations posess special libraries, and very few have 
information centers. But there is currently a change of attitude, and plans are 
underway, partly initiated by the Industry itself, to establish regional and/or 
subject or mission oriented Information centers requiring trained information workers. 

This leads to the conclusion that there is room in Israel for at leaat one additional 
graduate achool of llbrarlanshlp and/or information science. The first, oriented to 
academic and public libraries (including school libraries) , should train studenta 
with a liberal arts background; The other, oriented to special llbrarlanahlp and 
Information aclence, ahould accept students with a science background, and thalr 
raapactlva curricula should be planned accordingly. 
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ConalueioriQ 

From the experience gained during the 15 years of the existence of a formal training 
network In Israel (a country conventionally regarded as somewhere between a developed 
and a developing country) , the following conclusions can be drawn: 

a) Beginning at an early age, possibly from the primary school level, pupils 
should be Instructed In the use of libraries and Information media. 

b) Extensive training In library and Information science techniques should 

be provided on a non-graduate level - creating a cadre of good technicians 
to support the qualified reference librarian and Information scientist, 
and thus free them from routine library work. 

c) On a graduate level, different schools should be established for training of 
personnel for work In academic, public and school libraries, and for those 
for work In special libraries and Information centers. To try to embrace In 
one syllabus both trends seems inadvisable as they require entirely different 
educational backgrounds and different personal qualifications. 
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Appendix 1 



EXAMINATION SYLLABUS OF THE ISRAEL LIBRARY ASSOCIATION 
FOR THE ATTAINMENT OF A CERTIFICATE AS 
"ASSISTANT LIBRARIAN" AND "LIBRARIAN" 



A. "ASSISTANT LIBRARIAN" operative from 1972. 

The candidate has to be over 18 years of age; possess an Israeli matriculation* 
certificate or its equivalent, and minimum skill in typing in Hebrew or English. 

The examination is conducted in writing only and consists of four papers of 3 hours 
each. 

I. Cataloguing. 

(1) kinds of catalogues; (11) definition of terms in Hebrew and English; (ill) fi- 
ling rules; (iv) preparation of catalogue cards based on ALA rules - determination 
of entry and heading, descriptive and analytical cataloguing, serials and 
periodicals. 

II. Classification. 

By Dewey Hebrew Abridged Edition, which Includes special extension on Judaism and 
Israel. 

III. Basic Reference Tools for the Librarian. 

(1) part of the book: title page, contents page, introduction, main part, illustra- 
tions, notes, bibliographies, indexes, binding, cover; (11) various groups of 
reference books: general dictionaries, bibliographies (international, national, 
trade, etc.), encyclopedias, geographical sources of information, who's whos, 
periodicals; (ill) criteria for evaluation of reference books; (iv) techniques of 
reference work; (v) a list of 58 reference works the candidate should be familiar 
with. 

IV. Administration. 

(1) general knowledge about different kinds of libraries: national, academic, 
special, public, children's; (11) administrative, technical and reader's services; 
(ill) premises and equipment; (iv) the profession of llbrarlanshlp in Israel; 

(v) main international and national organizations concerned with llbrarlanshlp. 



B. "LIBRARIAN" in the Special Libraries trend. 

The current syllabus which has been operative since 1965 will remain in force till 
1972 inclusive. The new syllabus, operative from 1973, is still under delibera- 
tion, so that only a general outline can ge given. 

The candidate has to possess a certificate of "Assistant Librarian" and prove that 
he has worked at a recognized library for at least one year and that his knowledge 
of English is not less than required by the British General Certificate of Educa- 
tion 0 level. 



* Matriculation corresponds to the British General Certificate of education - A level 



Training in Israel 

The examination is conducted in writing only and consists of 5 papers. 

I. Cataloguing. 

Special problems in cataloguing. 

II. Classification. 

By UDC - Hebrew Abridged Edition, and Medium English Edition (when published) . 

Some knowledge about LC, Bibliographic and Colon Classification schemes. 

III. Bibliography. 

General bibliography on an advanced level, natural sciences and technology (incl. 
medicine and agriculture), social sciences and economics. Covers classical text- 
books, abstracting services, subject dictionaries, encyclopedias, who's whos, 
international and national organizations and their publications. 

IV. Management of Special Libraries and Information Centers: staffing, budgeting, 
reporting, Internal publications, reprography (systems, equipment and operation). 

V. Documentation. 

Principles of indexing and abstracting; literature search techniques and prepara- 
tion of bibliographies; principles and operation of manually operated feature and 
edge-notched cards; use of KWIC and KWOC Indexes; use of Citation Index; use of 
thesauri; basic knowledge about computer applications to library work; handling 
of collections of special material: reports, patents, standards, trade catalogues, 
etc. 



s 




» 
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Appendix 2 

SYLLABUS OF THE GRADUATE LIBRARY SCHOOL, HEBREW UNIVERSITY OF JERUSALEM 



The syllabus comprises 610 instruction (contact) hours and is divided into required 
and elective courses of 430 and 180 hourB respectively*. 



Required courses 

1, Cataloguing 

2, Library classification (general and Judaistic schemes) 

3, Hebrew literature > 

4, Bibliography (general, Judaistic, and humanities or social 

sciences or advanced Judaistic jar natural sciences) 

5* History of libraries and librarianship 
6* Administration and management 
7* Introduction to data processing 



Elective courses 

1. Advanced cataloguing 

2. Advanced classification (subject headings and Judaistic 

subject 8) 

3. Bibliography (children’s literature or oriental siWdies) 

4. History of books, writing and printing 

5. Book and paper preservation 

6. Comparative, librarianship 

7. Public libraries (incl. adult education, readers studies) 

8. Academic libraries 

9. Children’ 8 libraries 

10. Archives 

11. Preparation for printing 

12. Reprography 



70 


hrs. 


60 


hrs. 


20 


hrs. 


120 


hrs. 


20 


hrs. 


100 


hrs. 


40 


hrs. 


430 


hrs. 



40 


hrs. 


40 


hrs. 


10 


hrs. 


60 


hrs. 


10 


hrs. 


20 


hrs. 


70 


hrs. 


20 


hrs. 


30 


hrs. 


20 


hrs. 


20 


hrs. 


20 


hrs. 



360 hrs. 

the student must elect 180 hrs. 



Students with no practical library experience must carry out one month’s work at a 
recognized library, and after passing all the oral and written examinations are 
granted a diploma of ’'Qualified Librarian". 
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Appendix 3 



SYLLABUS OF THE GRADUATE COURSE IN INFORMATION SCIENCE 
OF THE CENTER OF SCIENTIFIC AND TECHNOLOGICAL INFORMATION 
In cooperation with the 
WEI2MANN INSTITUTE OF SCIENCE 



The syllabus comprises 560 Instruction (contact) hours, divided into 4 main subject 
areas | and a 6 weeks "Special Study" , carried out individually by each student under 
the supervision of a tutor selected by the programme director of the course. Home 
assignments are calculated on t;he basis of 2 hrs, for each contact hour. 



(i) General reference sources and basic search techniques - 33 hrs,; (ii) General 
sources of information in science and technology: books, periodicals, abstracting 
services, reports, patent specifications, standards, conference proceedings, trade 
catalogues, learned and professional societies - 40 hrs,; (iii) Subject-oriented 
sources of information: life sciences, earth sciences, exact sciences, engineering 
and technology - 70 hrs, 

2, Information Processing 152 hrs. 

(i) Cataloguing: types of catalogues, filing principles, union catalogues, card 
services - 22 hrs.; (ii) Library classification: Dewey, LC, Bibliographic Classi- 
fication, and a detailed study of UDC including index construction - 34 hrs,; (iii] 
Advanced classification and indexing: theory of classification, analysis of re- 
trieval systems, types of indexes - chain, hierarchial, etc., KWIC and KWOC 
Indexes, citation indexes, feature and edge-notched cards, descriptor languages, 
file organization and coding, chemical codes, search procedures - 44 hrs*; (iv) 
Oral and written communication; report and review writing, abstracting, transli- 
teration, preparation of bibliographies, proof-reading, editing, oral presentation 
(in Hebrew snd English) - 52 hrs. 

3, Use of Machines in Information Processing 145 hrs. 

(i) Reprography: contact and reproduction methods, equipment, materials, processes 
and costs - 18 hrs,; (ii) computer fundamentals: computer hardware, system analysit 
flowcharting, computer languages (emphasis on COBOL) - 75 hrs.; (iii) Computer ap- 
plications to library and information work in libraries for household purposes, 
construction of indexes, IR systems in use: MEDLARS, DDC, EURATOM, AEC, NASA, MARC 
CODATA, SDI, etc., costing - 52 hrs. 

4, information Management 104 hrs. 

(1) National and international information and library networks, institutions, 
cooperation - 8 hrs.; (ii) Introduction In statistics and survey techniques - 
36 hrs.; (Ill) Organization and methods - 14 hrs. ; < iv) Management of special 
libraries and information centers - 48 hrs. 

Miscellaneous 14 hrs. 

Introduction to course; cybernetics; techno! oglc.il i orecasting; visiting lecturers 
f Vom vurjous organizations; visits. 

Students passim', all written examinations and submitting a satisfactory "Special Study 

are granted a diploma. 



1, Sources of Information 



143 hrs 



4MW 
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Appendix 4 

SYLLABUS OF THE COURSE IN THE USE OF CHEMICAL INFORMATION SOURCES 
GIVEN TO GRADUATE STUDENTS AT THE WEIZMANN INSTITUTE OF SCIENCE, 1970 

/ 

1. Scientific Information: General definition and concepts of information, value of 
scientific information; methods of exchange of information and their efficiency; 
scientific literature as a source of information* 

2. Chemical literature: (i) Sources: encyclopedias, monographs, data collections, 
compendia, journals, dissertations, patents, industrial and government research 
reports, etc. (ii) Needs of the user: background information, current awareness, 
retrospective information. 

3. Tools: (i) Abstracting Services* (ii) Indexes: author, subject, classified, for- 
mula, patent, citation, keyword; their advantages and disadvantages, and their 
various uses* (iii) Chemical Nomenclature, (iv) Chemical Notation: purposes and 
uses, fragmentation codes, computer readable nomenclature (Dyson, Wiswesser, 
Hayward systems) , connectivity tables* 

4. Mechanized Information Systems: information store, search and retrieval tools; 
description of some operational mechanical information systems; the C*A*S. regis- 
try system: mode of operation and uses; commercially available tape services for 
search and retrieval, their mode of operation, and efficiency. 

5* Information Centers: Development of regional information centers, their tasks and 
services, construction and operation of small specialized information centers: 
purpose and use. Industrial information centers: personal files, modes of 
handling and operation* 




EDUCATION AND TRAINING OF LIBRARY PERSONNEL: THE INDIAN PROGRAMME 

Prof* P.N. Kaula 

University Librarian & Head of the 
Department of Library Science, 

Banaras Hindu University, Varanasi 



SYNOPSIS 

Traces briefly the history of education and training of 
library personnel in India which started with a training 
class in 1911, developing into a postgraduate Diploma 
course in 1937, and a Master’s Degree programme in 1948. 
This was followed by research facilities and special 
courses on f documentation. Describes the five levels of 
courses to produce semi-professionals, professionals, 
specialists, and scientists. 



1 HISTORICAL BACKGROUND 

1.1 Library Schools: Old Phase 

Library education in India started with the opening of a training class in the 
aarstwhile State of Baroda by W.A. Borden, an American Librarian and a student 
of Melvil Dewey in 1911. He had come to India to be the Director of the State 
Library Department in 1910. In 1915, another American Librarian Asa Don 
Dickinson who had been invited to organise the Punjab University Library, 

Lahore (now in Pakistan), started a library school at the University Library. 
Both the training programmes in Library Science by the two American Librarians 
and students of Melvil Dewey had a tremendous impact on library education in 
the country in the initial stages. This resulted in the introduction of 
Dewey Decimal Classification, Anglo-American Cataloguing Rules, and other 
American practices* The Punjab School continued till the Partition of India 
in 1947 and one of its earliest products who became the Librarian of the 
Imperial Library now called National Library at Calcutta, conducted a Diploma 
course from 1935-44 on similar lines. 

1.2 Library Schools: New Phase 

A new phase in library education started from Madras. This was due to 
Dr. S.R. Ranganathan, the then Librarian, Madras University and since 1965, 
National Research Professor in Library Science, who started the first systematic 
programme in Library Science education under the auspices of the Madras Library 
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Association in 1929. This course was taken over by the University of Madras in 
1931 which was converted into a full-time postgraduate Diploma course in 1937. 

The theory and practice evolved in India was taught on comparative basis with 
other well known practices in this school. The impact of this school in the 
country has been very significant. 

Another university that provided for Library Science training was Andhra Univer- 
sity which started a course in 1935. Banaras Hindu University established its 
Library School in 1941 and started the postgraduate Diploma course from 1942. 

This was followed by Bombay School in 1944, Calcutta School in 1946, Delhi School 
in 1947. After the Independence of the country, a number of library schools have 
been established in various States in India. Today there are library schools 
administered directly by universities or through their affiliated colleges in 
13 States. The States of Bihar, Jammu and Kashmir, Himachal Pradesh, Orissa, 
and Nagaland have no university library school of their own. An account of 
library schools had been given by the author in 19621 and 1967^. Carl Hints 
had also made his own evaluation in 19633. a chronological table of the library 
schools in universities in India is given in Appendix 1. The State-wise distri- 
bution of library schools is given in Appendix 2. 

2 LEVELS OF THE COURSES 

The levels of Library Science courses in India are as under:- 



S.No, 


Course 


Award 


Level 


1. 


Certificate in Library Science 


Cert Lib Sc 


Semi-Prof ess ional 


2. 


Bachelor of Library Science/ 


B Lib Sc/ 


Professional 




Diploma in Library Science 


Dip Lib Sc 




3. 


Master of Library Science 


M Lib Sc 


Specialist 


4. 


Special Courses 


ADRTC/ADR 


Specialist 


5. 


Research in Library Science 


PhD 


Research 



2.1 Certificate in Library Science Courses 

These courses arc mainly conducted by professional associations to train semi- 
professionals who hold lower positions in libraries or Join as school librarians. 
Some of the universities and colleges have been conducting this course of 3 to 
4 month's duration. A list of library schools at this level is given in 
Appendix 3, 



2.2 B Lib Sc Courses 

B Lib Sc course is conducted by universities to produce professionals. 

This course was earlier named postgraduate Dip Lib Sc (Diploma in Library 
Science). The change was brought about as a result of the recommendation of 
the Review Committee of the Univarsity Grants Commission^. This course is of 
one year's duration and is open for graduates only. 
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2.3 M Lib Sc Courses 

M Lib Sc course was first instituted in the country in 1948 at the University 
of Delhi. It is a two-year course - the first year being a first professional 
course which was then a Dip Lib Sc and now B Lib Sc. Banaras Hindu University 
started this course in 1965, This was followed by two other universities at 
Bombay (1967) and Chandigarh (1970). 

2. 4 Special Courses 

The Documentation Research and Training Centre, Bangalore instituted a course 
in Documentation of 16 month’s duration in 1962. It provides for post-course 
apprenticeship as well. The course has been recognised equivalent to M Lib Sc. 
The Indian National Scientific Documentation Centre, New Delhi, and the Indian 
Association of Special Libraries and Information Centres, Calcutta, are offering 
Diploma Courses in Documentation and Reprography, and Special Librarianship 
respectively, 

2.5 Other Courses 

The Women's Polytechnics set up in the country at various places are conducting 
Diploma in Library Science (Dip Lib Sc) courses of one to two years duration. 
These courses are open to undergraduates as well. At present the Polytechnics 
at Ambala, Bangalore, Chandigarh, Delhi, Jullundur and Rourkela are conducting 
these courses. 

2 . 6 Research 

Facilities for research in Library Science, leading to PhD are available in the 
University of Delhi, and DRTC, Bangalore. The minimum duration for earning a 
research degree is two years. 

3 METHOD OF ADMISSION 
3. 1 B Lib Sc Couflse 

The minimum educational qualifications prescribed by all universities for ad- 
mission to B Lib Sc is a basic bachelor's degree. However, in several library 
schools, the demand of candidates seeking admission is so heavy that graduates 
do not get admission in such schools. In that category are the schools at Delhi 

University and Banaraa Hindu University. In Banaras Hindu University, the 

applicants seeking admission for 1969-70 and 1970-71 have been 519 and 480 

respectively. It, therefore, conducts an Admission Test to select candidates. 

In case of working librarians and deputed candidates, higher academic degrees 
are not insisted. In several schools a specific percentage of seats is fixed 
for working librarians, Madras and Mysore schools insist on a pre-training 
apprenticeship of 4 months. Some schools do not attract talents and have also 
a large number of seats for admission. Some figures in this connection are as 
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under: 

These relate to 1969-70 and is based on a questionnaire. 



1 . 


Total number of students enrolled in 
20 universities 


1,000 


2. 


Highest enrollment in a Library 
School (Calcutta) 


103 


3. 


Lowest enrollment in a Library 
School (Lucknow) 


15 


4. 


Percentage of enrollment per school 


35 


5. 


Percentage of Women students 


53 


6. 


Average age of students 


26 


7. 


Admission by academic attainments: 






B.A. 


53.4% 




B.Sc. 


15.4% 



M.A. 30.00% 

M.Sc. 1.2% 

8. Knowledge of Foreign Language 3.00% 

3.2 M Lib Sc Course 

The minimum qualifications prescribed for admission to M Lib Sc course is the 
basic professional degree or postgraduate dlploaa (B Lib Sc/Dlp Lib Sc). A 
minimum of 50% marks in the basic professional degree is required at Banaras 
and Delhi Schools followed by an Admission Test. Delhi , however , does not 
screen its own students. Bombay School does not prescribe any minimum marks 
for admission but stipulates a minimum of three years experience in an approved 
library. 

DRTC course prescribes an intensive screening through a Test for a very limited 
number of seats. 

4 SCHEME OF PAPERS 
4.1 B Lib Sc 

The curriculum for B Lib Sc course is more or less in conformity with the scheme 
of papers recommended by the Review Committee of the University Grants Commission^ 
The core papers are the following: 

1. Library Organization 

2. Library Administration 

3. Physical Bibliography and Book Selection 
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4. Documentation, Bibliography and Reference Service 

5. Library Classification (Theory) 

6. Library Cataloguing (Theory) 

7. Library Classification (Practice) 

8. Library Cataloguing (Practice) 

9. Records of Practical Work 

Some schools offer additional subjects ns given below: 

1. Paper on General Knowledge by Bombay, Karnataka, 

Poona, Shlvajl and SNDT Schools* 

2. Paper on Survey of Books and Ideas by Osman la 
School* 

3. Paper on Library Development in India 
by Andhra School. 

4. A Project Report with 100 marks by Punjab 
School. 

5. An elective to be selected from (a) 

Children' 8 libraries; (b) High School, 

College and University Libraries; (c) Public 
Libraries; and (d) Special Libraries at 
Osmanla School. 

Provision of additional papers nay be justified but the duration of the course 
of one year does not provide sufficient time to go through the professional 
content of the courses. Moreover, General Knowledge Is implied In Reference 
Service if It Is covered properly* 

4.2 M Lib Sc 

The Syllabus of M Lib Sc course In Banaraa and Delhi Schools Is on the pattern 
suggested by the Review Committee of the University Cranta Commission 6 . The 
subjects prescribed are: 

1* Universe of Knowledge: Its Structure and Development* 

2, Depth Claaalf ication (Theory) 

3. Advanced Library Cataloguing (Theory) 

^ 4. Dspth Claaalf Ication (Practlcs) 

3. Advanced Library Cataloguing (Practlcs) 
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6. Any one of the following: 

(a) Public Library System 

(b) Academic Library System 

(c) Research and Technical Library System 

(d) Documentation 

7. Literature Survey in any one of the following: 

(a) Humanities 

(b) Natural Sciences 

(c) Social Sciences 

8. Current Problems in Librarianship as one of the 
following projects: 

(a) Field Studies 

(b) Survey Projects 

(c) Documentation Project 

(d) Literature Survey 



Bombay School, however, does not follow the UGC pattern. The scheme of papers 
cf this course is as under:- 

Group A: Written Papers. 



(a) Comparative Studies in Librarianship (two papers) 

(b) Research methods and Documentation Techniques 



Group B: Observation 

Group C: Oral and Viva Voce. 

Punjab School has adopted a modified pattern of written and practical papers. 
The scheme of papers of this course is as under: 



3. Documentation and Information Retrieval 

4. Advanced Classification and Cataloguing (Theory) 



5. Advanced Classification and , Cataloguing (Practical) 

6. Bibliography and Classics 



7» Library Organisation and Administration 
8. Dissertation 



1. Evolution and pattern of knowledge 



2. Advanced Reference Service 
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SYSTEM OF EVALUATION 

The examination for both the degrees are held at the end of the course* There is 
a written examination as well as practical* Banaras School has introduced the 
system of having examiners for practical papers like other disciplines in a 
university* Rajasthan School follows this practice for B Lib Sc course* Delhi 
and Chandigarh schools have introduced the semester system* Periodical class 
tests and internal assessments are also made* Terminal examinations are held 
for evaluating the student's performance during the course of study* In B Lib Sc 
course, professional tour of various libraries in the country is made for obser- 
vational study and evaluation of this report is added to the marks of the final 
examination* Some schools have a regular programme of tutorials, seminars and 
observational study. 

The gradation of the results is in the category of Class I, Class II and Class III 
All the schools prescribe a minimum of 60% marks to obtain a Class I, but there 
is variation in the percentage of marks to obtain a Class II which is generally 
a minimum of 50%* In Banaras, Burdwan, Calcutta and Gujarat Schools, Class III 
has been abolished* The minimum for a Class II in Burdwan School is 45%, 

Calcutta School 40% and in Gujarat School 45%* The minimum marks for a Class III 
varies between 30% and 35% in some schools. 

In M Lib Sc, the gradation is between Class I and Class II* Papers 7 and 8 
prescribed in Dinaras and Delhi Schools are in the form of dissertations which 
are evaluated like other d issertations in a university* Distinction in a sub- 
ject is awarded if the marks obtained are above 75%* 

THE FACULTY 

The librsry schools in India started with part-time teaching staff who were 
basically appointed librarians to work for their respective libraries* The 
status that teaching could give to a librarian, induced a number of them to 
start Library Science courses managed basically with part-time teachers drawn 
from the library* Aligarh School has been the first school to appoint full- 
time Lecturers in 1957* Of course, Dr* S*R* Ranganathan had been the full-time 
teacher at the Delhi School from 1947-55* The Review Committee on Library 
Science of the Univeraity Granta Commission (UGC) recommended full time teaching 
ataff in library achools* The author of this paper was appointed the first full- 
time Reader (Asaiatant Profeasor) at the Delhi School in 1958* The pattern of 
teaching ataff variea from one achool to another* While aome schools have 
appointed full-time faculty members, some are atill managed entirely by part- 
time teachers who are basically practising librarians* A large number of 
achoola have both part-time and full-time teachers* In some achools librarians 
from other: libraries and persona from other disciplines are aaaociated with the 
teaching aa viaiting teachera* 

The position of the library achools is also governed by the provlaion of teaching 
ataff* Wherever there are part-time teachera, the concerned library achool la 
managed by the librarian of that univeraity* 16 univeraity achoola and 4 poly- 
technic achoola are headed by full-time teachers* In r?at of the achoola, 
librarians are alao heada of the teaching departments* The achoola with full- 
time Profeaaora aa heads are at Banaraa and Madraa* DRTC haa Dr* Ranganathan 
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as Hon Professor with one more Professor as its head. An analysis of the 
teachers is given in Appendix 4. 

The analysis in the above appendix shows that there are 80 full-time teachers 
including full-time heads of departments. The total number of part-time teachers 
is 88. 

6. 1 Library Facilities 

Library facilities in most of the schools are available to teachers and the stu- 
dents. The Delhi School has a book, collection of about 6,000 volumes and re- 
ceives 112 periodicals on Library Science. It has a professional Librarian to 
manage i t. Banaras School has a collection of 5,000 volumes and it is subscrib- 
ing to 58 periodicals on Library Science. At the lower level, there are newly 
established Library schools which have practically no library on Library Science 
of their own. Even in matters of text books and the number of copies of basic 
books, they depend on the assistance of neighbouring libraries. Rajasthan 
School has built up a Workshop for the use of students. 

7 EVALUATION OF THE COURSES 

7. 1 Number of Schools 

The number of library schools offering B Lib Sc is fairly adequate. Out of 83 
universities functioning in India, 34 universities have departments of Library 
Science, Four universities offering M Lib Sc is not, however, adequate, ’With 
the growth in the number of professionals, India will need 164,263 M Lib Sc and 
B Lib Sc holders at the end of Sixth Five Year Plan. The estimates of man- 
power requirements have been worked out in the following seven categories by a 
panel at DRTC, Bangalore and given in a table forming Appendix 5. 



S.No. 


Category 

(Class) 


Professional 

required 


Minimum Qualifications 


1. 


Class 


A 


6 


PhD or M Lib Sc & Master's 
decree in some other subject 


2. 


Class 


B 


7 


tt 


3. 


Class 


C 


55 


tt 


4. 


Class 


D 


656 


M Lib Sc or B Lib Sc and a 
Master's degree In some other 
subject 


5. 


Class 


E 


3,593 


tt 


6. 


Class 


F 


159,946 


B Lib Sc (Second Class) 


7. 


Class 


G 


79,689 


Certificate in Library 
Science 



( The annual output from library schools is about 1,000 for B Lib Sc and about 

| 50 for M Lib Sc. With the accelerated Library development programmes In the 

country, the product from schools have to be Increased, 
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7.2 Contents of the Courses 

Library schools generally offer a one set of B Lib Sc course which is endorsed 
by the University Grants Commission. As is evident, the scheme of papers do 
not take into consideration the developments taking place elsewhere and also 
within the country. They do not meet the new functions of the libraries ade- 
quately. The emphasis in the scheme of papers should be in developing adequate 
and up to date library services in institutions. In M Lib Sc, there should be 
more elective papers so that students turn out specialists in one area of 
Library Science or other. The needs of special libraries, university libraries 
and documentation centres requiring specialised services should be covered at 
this level. Special courses on school llbrarlanshlp and special llbrarlanshlp 
should be introduced. M Doc (Master in Documentation) courses be instituted 
along with M Lib Sc. 

7.3 Standardization of Courses 

There should be an accrediting agency for degree courses. The Indian Library 
Association being inactive, the initiative has to be taken by the Indian Associa- 
tion of Teachers of Library Science (IATLIS) established in December 1969. It 
held it 8 annual seminar on Teaching Methods in Library Science in December 1970. 
The growth of Library Schools should be checked and an evaluation of the teaching 
and other facilities in existing schools made so that higher teaching standards 
are maintained. The Association (IATLIS) is conducting a survey of library 
schools and plans to bring out a "Library Science Year Book" from 1972* Similarly 
an accrediting agency is required for Cert Lib Sc courses. No evaluation of 
these courses has been made so far. The Review Committee of UGC was concerned 
with the evaluation of professional courses conducted by the universities. It 
has, however, left the teaching at the semi-professional level to library asso- 
ciations who by themselves are unable to reach a particular standard. While 
there is a certificate course of one year's duration conducted by the Delhi 
Library Association from 1955, there are similar courses of 3 to 4 months duration. 
Papers offered also range from three to eight in these courses. Some colleges 
have also started such courses, though no recognition has been accorded to them 
by the universities with which the colleges are affiliated. 

There is a brain drain in library science as well. Several librarians who qualify 
even with M Lib Sc degree migrate to the United States to join MLS programme 
once again. There is therefore need to examine the equivalence of M Lib Sc with 
MLS degrees offered by American and Canadian universities. M Lib Sc in India is 
a two-year full-time professional degree programme while MLS in U.S.A. is not so. 
Perhaps a Committee of Association of American Library Schools and the Indian 
Association of Teachers of Library Science can resolve this issue. Recently 
British Library Association has taken up the issue of equivalence of FLA and 
MLS with the Association of Canadian Library Schools. 

7.4 Duration of Courses 

The duration of a year to both the professional graduate degree courses and post- 
graduate degree course is not sufficient. In the M Lib Sc course! especially 
the two projecta covering intenalve knowledge-content and research make it 
difficult for students to read extensively for specialisation within the limited 
period. The students do not find sufficient time to do practical experimental 
and project work; and at the same time, study for the number of papera prescribed 
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in one year* The duration of both the degree courses be Increased to three 
years Instead of two years as at present* The project work may be completed 
within the next academic year after the evaluation of theory and practical 
papers* Dr* Ranganathan has agreed on this Issue?. For admission to H Lib Sc, 
a minimum three years experience In library work should be prescribed so that 
the students are familiar with the application of theoretical knowledge learnt 
during B Lib Sc period. 

8 RESEARCH FACILITIES 

The Library schools even at the advanced level, have little facilities for doing 
research. The Delhi school has not attracted sufficient candidates for Its 
doctoral programme. DRTC Is the only school In the country where research In 
various branches of Library Science and Documentation Is being conducted by Its 
faculty members and Is able to attract candidates for doctoral programme. There 
are very few teachers In other library schools who engage themselves In research 
and publish papers or books. Some teachers In the Banaras, Delhi, Gauhatl, 
Punjab, and Rajasthan schools have produced works on library science. Facilities 
to do research for development of the subject by teachers In library schools 
should be adequately provided by the university authorities and the University 
Grants Commission. Sufficient fellowships and scholarships should be made 
available both to the students and teachers In library schools* The seminar 
on the Teaching of Library Science In 1966, recommended that library schools 
actively take to research on library problems, faced by librarians In their day 
to day operations^. 

Jesse H. Shera strongly pleads for research programme In library schools. He 
says "Librarians know very well how to do what they do, but they never concern 
themselves to any great extent with why they do it. "9 Library education In 
India must be research-oriented at all levels. 

9 TEACHING METHODS 

Library schools conduct training programme mainly through lecture method. In 
some schools, other methods of teaching like seminars, tutorials, colloqula, 
project work, and observational study are emphasised. All the schools should do 
minimum of lecturing and Impart Instruction through other effective methods of 
teaching. The teachers of Library Science should also undergo a training pro- 
gramme on teaching methods***. Teachers should have worked In some good library 
before they become teachers* They should engage themselves constantly with the 
practical problems in a library and demonstrate them to the students. The 
University Library should be used as a laboratory of library science by teachers 
and students. Teaching should be Integrated with theory and practice of Library 
science* 

10 RECOMMENDATIONS AND ATTAINMENT 

Library education In India has been a recurring theme of debate and discussion 
among the members of the profession* Four All-India seminars have been held 
on the subject during the last decade* Two of these seminars were held In 
Delhi (1960 and 1966) **, one at Bombay (1965) and one at Banaras (1966)12, Li- 
brary education has also been discussed in other professional conferences* A 
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survey of the Delhi School has been made by Morris A. Gelfand^^, A survey of 
library education has been made at Banaras School^. Karl Hintz and the author 
have made evaluation of the training programmes in 1962, 1963 and 1967, Review 
Committee of UGC (with Dr. Ranganathan as Chairman and the author as one of the 
expert members) set up in 1961, has made a number of recommendations. Another 
Committee for M Lib Sc courses has been appointed by UGC last year. The recom- 
mendations of the seminars and of UGC Committees have gone a long way in 
developing library education in India on proper lines. 

India has the unique distinction of starting M Lib Sc Programme in 1948 when 
no other university in the entire Commonwealth had provision for it. This 
position of India continued till 1965. India has also established an endowed 
Chair in Library Science at the Madras School in 1958 which is perhaps the only 
endowed professorship in the East. India has also appointed Dr. S.R. Ranganathan 
as 'National Research Professor in Library Science' , thus recognising library 
science as a discipline at par with other academic disciplines in the country. 
This recognition according to Mr. P.N. Kirpal, Chairman, Executive Board of 
Unesco, is "unique in the world"15. By forming the Indian Association of 
Teachers of Library Science, India has achieved another distinction in Library 
Science by instituting a Fellowship for study in library education at DRTC, 
Bangalore. This has been made possible due to the donation received from a 
distinguished teacher from the United States. It is proposed to have exchange 
of teachers with other countries to give new shape to library education. 
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APPENDIX 1 

CHRONOLOGICAL TABLE OF THE LIBRARY SCHOOLS IN 
UNIVERSITIES IN INDIA 



Place of 
School 


Name of the 

University 

School 


Year of 
Foundation 


Name of 
Course 


Madras 


Madras 


1931 


Cert Lib Sc 


Waltair 


Andhra 


1935 


Dip Lib Sc 


Madras 


Madras 


1937 


B Lib Sc 


Varanasi 


Banaras 


1941 


Cert Lib Sc 


Varanasi 


Banaras 


1942 


B Lib Sc 


Bombay 


Bombay 


1944 


B Lib Sc 


Calcutta 


Calcutta 


1946 


Dip Lib Sc 


Delhi 


Delhi 


1947 


B Lib Sc 


Aligarh 


Aligarh 


1951 


M Lib Sc 
Ph D 

Cert Lib Sc 


Baroda 


Baroda 


1956 


B Lib Sc 


Nagpur 


Nagpur 


1956 


B Lib Sc 


Poona 


Poona 


1956 


B Lib Sc 


Ujjain 


Vikram 


1957 


B Lib Sc 


Aligarh 


Muslim 


1958 


B Lib Sc 


Hyderabad 


Osman i a 


1959 


B Lib Sc 


Jaipur 


Rajasthan 


1959 


B Lib Sc 


Chandigarh 


Punj ab 


1960 


B Lib Sc 


Trivandrum 


Kerala 


1961 


B Lib Sc 


Jaipur 


Rajasthan 


1961 


B Lib Sc 


Pachmarhi 


Saugar 


1962 


B Lib Sc 


Lucknow 


Lucknow 


1962 


B Lib Sc 


Dharvar 


Kamatak 


1963 


B Lib Sc 


Jadavpur 


Jadavpur 


1963 


B Lib Sc 


Bombay 


SNDT 


1963 


B Lib Sc 


Ahmed ab ad 


Gujarat 


1963 


B Lib Sc 


Gwalior 


Jiwaj i 


1964 


B Lib Sc 


Kolhapur 


Shivaji 


1964 


B Lib Sc 


Burdwan 


Burdwan 


1964 


Dip Lib Sc 


Varanasi 


Banaras 


1965 


M Lib Sc 


Mysore 


Mysore 


1965 


B Lib Sc 


Reva 


Rewa 


1965 


B Lib Sc 


Gsuhati 


Gsuhati 


1966 


B Lib Sc 


Varanasi 


Varanaaeya 

Sanskrit 


1967 


Dip Lib Sc 


Bombay 


Bombay 


1968 


M Lib Sc 


Jabalpur 


Jabalpur 


1968 


B Lib Sc 


Kurukshetra 


Kurukshetra 


1969 


B Lib Sc 


Patiala 


Punjabi 


1969 


B Lib Sc 


Bhopal 


Bhopal 


1970 


B Lib Sc 


Chandigarh 


Punj ab 


1970 


M Lib Sc 


Ssugar 


Ssugar 


— -19J* 


B Lib Sc 
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APPENDIX 2 

STATE-WISE LIBRARY SCHOOLS 



ERJC 



S.N. 


State 


University 

Schools 


Other 

Schools 


Total Number 
of Schools 


1. 


Andhra Pradesh 


STATES 

2 


1 


3 


2. 


Assam 


1 


- 


1 


3. 


Bihar 


- 


1 


1 


4. 


Gujarat 


2 


1 


3 


5. 


Haryana 


1 


1 


2 


6. 


Himachal Pradesh 


- 


1 


1 


7. 


Kerala 


1 


1 


2 


8. 


Maharashtra 


5 


2 


7 


9. 


Madhya Pradesh 


7 


1 


8 


10. 


Mysore 


2 


2 


4 


11. 


Orissa 


- 


1 


1 


12. 


Punjab 


1 


1 


2 


13. 


Raj as than 


1 


- 


1 


14. 


Tamil Nadu 


1 


1 


2 


15. 


Uttar Pradesh 


4 


4 


8 


16. 


West Bengal 


4 


1 


5 


17. 


Chandigarh 


UNION TERRITORIES 
1 


1 


2 


18. 


Delhi 


1 


3 


4 






Total 34 


23 


57 



Note ; Other schools Include DRTC, and schools at INSDOC, 
Polytechnics | associations and colleges. 
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APPENDIX 3 



COURSES FOR SEMI-PROFESSIONALS 



Association Schools . 



I# Andhra Pradesh Library Association 
2# Bengal Library Association 
3# Bihar Rajya Pustakalaya Sangh 

4, Bombay Library Association 

5, Delhi Library Association 

6, Gujarat State Library Association 

7# Indore Divisional Library Association 

8. Kerala Librarians Association 

9, Maharashtra Library Association 
10* UP Library Association (3 Centres) 



11. Rajasthan University 

12. Madras University 

Government Schools 

13. Library Training School, Bangalore 

14. Library Training School, Gwalior 

15* Library Training School, Simla 

16* Library Training School, Madras 

17# Library Training School, Patna 



18# Agra College, Agra 

19# BR College, Agra 

20# Dharam Samaj College, Aligarh 



Universities 



Colleges 
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APPENDIX 4 





AN ANALYSIS OF THE FULL-' 


TIME AND PART-TIME TEACHERS 






IN THE LIBRARY 


SCHOOLS IN 


INDIA 


School 




Status of 


Full-tine 


Part-time 


Total 






the Head 












UNIVERSITY SCHOOLS 






Aligarh 




Reader 


4 


- 


4 


Andhra 




Reader 


2 


4 


6 


Banaras 




Professor 


4 


4 


8 


Baroda 




Reader 


- 


5 


5 


Bhopal 




Lecturer 


1 


1 


2 


Bombay 




Professor 


- 


4 


4 


Burdwan 




Lecturer 


- 


4 


4 


Calcutta 




Lecturer 


2 


9 


11 


Delhi 




Professor 


6 


3 


9 


Gauhati 




Reader 


2 


1 


3 


Gujarat 




Professor 


1 


2 


3 


Jabalpur 




Lecturer 


2 


- 


2 


Jadavpur 




Reader 


2 


2 


4 


Jlvaj 1 




Lecturer 


2 


- 


2 


Kamatak 




Reader 


3 


6 


9 


Kerala 




Professor 


3 


2 


5 


Kurukshetra 


Reader 


2 


3 


5 


Lucknow 




Lecturer 


2 


1 


3 


Madras 




Professor 


4 


1 


5 


Mysore 




Reader 


1 


3 


4 


Nagpur 




Professor 


- 


5 


5 


Osman Is 




Lecturer 


2 


2 


4 


Poona 




Professor 


2 


1 


3 


Punjab 




Professor 


1 


3 


4 


Punjabi 




Reader 


2 


1 


3 


Rajasthan 




Reader 


5 


- 


5 


Reva 




Lecturer 


2 


1 


3 


Saugar 




Reader 


- 


3 


3 


Shlvajl 




Reader 


1 


2 


3 


SNDT 




Reader 


1 


4 


5 


Varanaseya 


Sanskrit 


Lecturer 


- 


3 


3 


Vlkram 




Reader 


5 


2 


7 






INSTITUTIONAL SCHOOLS 






DRTC 




Professor 


5 


_ 


5 


INSDOC 




Reader 


1 


4 


5 






WOMEN'S POLYTECHNIC 






Aabala 




Lecturer 


2 




2 


Bangalore 




Lecturer 


3 


- 


3 


Chandigarh 




Lecturer 


2 


- 


2 


Delhi 




Lecturer 


2 


1 


3 


Jullunder 




Reader 


2 


1 


3 


lourkela 




Lecturer 


2 


- 


2 
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PUBLIC LIBRARY SYSTEM 

I* Libra chine 

2. Branch Library 

3. Clcy Central Library 
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C 

D 
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4. Rural Central Library 

5* State Central Library 

Total 

6. National Central Library 

Total 

ACADEMIC LIBRARY SYSTEM 

1. Universities and 

Similar Bodies 
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Special Library and 
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MtOrtSSIONAL DBVELOPMNT IN LIMANY-INPORMATION SCIENCE AT 

tn cm imivnsirr or new yore 

Vivian S. Seaaiona 

Aaoocleto Professor, University Graduate Division 
The City. Onivaraltr of Haw York 



CUNY' a rrofaaalonal Development Program la doaalr linked -*■ in 
con ca^t and effectuation ~ to tha looger-range goal of a Ph.D. 
In Library- In format loo Sdanca. Tha over-all frostwork la pro- 
vldad by fonr groupings of courses, of which tha moat general 
containa both hunanlotlc (or qualitative) and quantitative con- 
cepts Tha three arena available for aore focuaaed study ara 
Information accaaa, lto naaagameot, and lta technology* 



Introduction 

Tha lataraat In rrofaaalonal Development for Llbrarlana that prompted tha eetab- 
llshnant of a Canter for tha Advancement of Library-Information Science at Tha 
City Onivaraltr of New York la by no naans United to that lnatltmtlon. Tha 1970 
work by Daatoa, Datmaan M.L.S. and Ph.D.. (American Library Aaeocletlon) la tha 
moat important currant otudy In thin area. It reveale that aavaral library 
echeola ara alao concerned about non- doctoral education beyond tha Heatar of 
Library Sdanca or Naeter of Sdanca la Library Sdanca. The ALA etudy alao re- 
veala a lack of coneleteat pattern, If Indeed a pattern can be dlacaroed. 

One evident fact la tha aalatanca of none roletloaahlp batman poat-naatara 
profaeelomal development and more formal doctoratoo. Under these drcunataacaa. 

It la Internet Ins to note that tha program under dlacuaalon haa bean undertaken 
by tha Ualverelty Graduate Dlvlalon ltaalf, tha admlnlatrator of tha Vh.D. programa 
throughout Tha City Ualverelty (CUNY). Baaldaa, no other kind of academic unit 
can have a greater lataraat In tha Improveme n t of thoee' profaaalonal Information 
aarvlcoo that oupport graduate raaaarch. 



Library- Information Sdanca at CUNY 

What Daatoa did not — and could not — include vaa any rafaraaca to tha Canter for 
tha Advancement of Library-Information Sdanca at CUNY. Tha raeaon la quite 



SYNOPSIS 



aiaplat tha Canter wee being formulated, and lto programa aubjected to their 
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first livn tents, at the vary t iir^? that the ALA work wtt being prepared for 
publication. Furthermore CUNY'* Center had, from it» first inception ns an idea, 
closer ttofl to the Special Libraries Association (SLA) and the American Society 
for Information Science (ASIS). It Is therefore interesting that the program 
now In isplexcntation anticipated — hopefully successfully — so many of the 
problem* raised by the ALA study. 

It is no coincidence that the program under discussion incorporates o great deal 
of the thinking that surrounded the Curriculum Committee Workshop of the ASIS 
Special Interest Croup in Education. It is also no coincidence that it draws 
upon several years of workshop experience by the Documentation Croup of the 
New York Chapter of SLA* The Director of the CUNY Center was very much involved 
in both sets of activities* The program also draws heavily upon the experience 
of a predecessor at the University Graduate Division, Project URBANDOC, an R and D 
effort In information storage and retrieval that was funded by the U.S. Depart- 
ment of Housing and Urban Development* 

The background of the Center virtually dictated that its Professional Development 
represent an integration of librarlanship with the computer-inspired developments 
that have transformed the older term "documentation" into "information science". 
Thus "Library-Information Science". The earlier project experience of the princi- 
pal staff (Project Director end Systems Analyst) also indicated that there would 
be an emphasis on practicality, on the ability to input and output information* 
However, the sponsorship of the Graduate Division also guaranteed the presence 
of a strong component of scholarship even though formal academic credits are not 
presently involved. 

Relationship to Doctoral Study 

Although the Professional Development Program under discussion was established 
for the express purpose of helping practicing librarians and other information 
personnel become better acquainted with newer developments in their field, all 
the courses relate to another responsibility of the Center for the Advancement 
of Llbiary-Informatlon Science. That is to develop full-fledged Ph.D. studies 
in this field. The kinds of work now being given are therefore not necessarily 
terminal; they may also satisfy some of the requirements for the Ph.D. The only 
concession to the present somewhat Informal status of the programs is that the 
Instructors suggest, but do not require, outside readings, homework, and 
laboratory experience. 

It is anticipated that when the University does authorize the granting of academic 
credit in Library- Information Science — hopefully in another year — it will 
continue to enable non-candldatea to participate appropriately. There are also 
discussions underway which would widen the non-doctoral students' options to 
the point where he or she could elect to obtain a second masters degree (or a 
first, if entering without one). If necessary, the Center is prepared to design 
a minimum number of dual offerings — one set for formal credit and the other 
with lesser demands on tha students. That, however, will not be done unless the 
necessity for duality la proven both pedegogically end financially* 
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Courses: Design and Implementation 

There are presently four areas of Instruction in the over-all design. The first 
is (( general (( in that It contains those courses, seminars and colloqula that are 
considered to be more applicable to the entire program than to any one area of 
concentration. Equally at home in the general category are such humanistically- 
oriented courses as Information in Society and such quantitatively-oriented ones 
as Mathematical Concepts. The latter course was the first of this group to be 
Implemented. Successful completion of basic work in set theory, probability and 
statistics is considered as essential for any serious work In library-information 
science. It means, for one thing, the ability to read the Journal of the ASIS 
In It 8 entirety. 

The three other areas represent three foci of concern: information access, 

management, and technology. In a formal degree-granting program, the core curri- 
culum would Include at least the Introductory work in each; a similar spread Is 
recommended for non-credit students. 

1. Information Access 

The offerings can be expected to be heaviest in information access, which in this 
program means chiefly bibliographic access and control. During the Initial year 
of Professional Development, there were three courses in this area: 

Document Analysis, Document Analysis Workshop, and Introduction to Information 
Retrieval. The first was strictly a "talk" course, while the other two Included 
Interaction with operational information systems. The students in the Document 
Analysis Workshop prepared a Thesaurus, and then used the Thesaurus to Index a 
selection of NY Times articles. Their input was processed for them, and they 
received as output two sets of indexes. One used the programs that had been 
developed by Project URBANDOC and the other was processed by the American 
Petroleum Institute. 

The Information Retrieval class was divided between introductory lectures, the 
opportunity to address class-formulated queries to four large-scale 1R systems 
(two of them SDI oriented, and two of them offering retrospective searches), and 
the opportunity to prepare and have processed within the University Graduate 
Center a small data base of its own. The class also queried its own data base, 
comparing the effects of searching on descriptors with the effects of searching 
on author, title, and abstracts. 

The Library-Information Science program Is particularly fortunate In having ac- 
quired a programming package for the IBM 1130 that will permit the students to 
simulate with small data bases most of the search capabilities of larger computer 
installations. The resident staff of the Center expects to devote a considerable 
effort In the coming year to furthering the teaching capabilities of this system. 
More advanced students will be expected to gain greater interaction 
with the system itself, whether or not they are candidates for degrees. The 
capabilities of the system also include substantive data retrieval, one of the 
many additional units to be added to the information access area. 



511 




VI Selection, Education and Training of Pereonnel 

2. Management 

In thla area, the Professional Development Program started with Management 
Science Concepts. It soon developed, however, that thla course could advanta- 
geously concentrate on the Operations Research alone, which might remove It to 
the more general group of courses. In any case, the next two coureee scheduled 
for Implementation are Financial Administration and the Information Industry. 
Plans for the atlll more distant future Include a variety of other management- 
oriented studies In Library-Information Science. The large reservoir of busi- 
ness administration expertise at the Bernard M. Baruch College of the University 
offers unlimited educational opportunities In this area, with many administrative 
possibilities for effectuating the opportunities. 

3. Technology 

The third and last area of focus Is the obvious ons of technology. The 
Professional Development Program started with a course In Computer Concepts and 
Programming. It utilized the PDP-8 computer and terminals, thus starting 
students early with hands-on experience. Although a small and relatlvsly 
Inexpensive system geared primarily to quantitative analysis, Its teaching 
capabilities for Library- In format Ion Science are by no means that limited. 

Both the "hardware" and the "software" aspects of the technology will be de- 
veloped further, with the IBM 1130 being used for on-slte processing as well 
as remote access to a large IBM 360. However, the prime technological thrust 
of Library- Information Science la not to develop computer scientists. If 
specific needs for advanced study In that area should arise, appropriate doc- 
toral programs at the University will be called upon to furnish assistance. 

People: Students, Faculty, Advisors 

1. Students 

The appeal of the Professional Development Program In 1970-71 was entlrsly to 
"working stiffs" In libraries and Information centers. Their employers, many 
of whom paid the tuition, range from the army to the university, from drug 
companies to advertising agencies, from the professional society to Hall Strsst. 
A total of 89 Individual studenta (not counting librarians from the City Univer- 
sity Graduate Center), participated to the extent of 135 course registrations. 
Although all classes were scheduled for 14 weekly meetings each, from 6:30 to 
8:30 In the evening, a great many of the students travelled from the far reaches 
of the metropolitan area. 

A general characteristic of the student body Is that its active Involvement In 
professional activity Is by no means limited to this particular program. Prac- 
tically all the students ars members of professional societies, many of them 
belonging to several. Those societies were. In fact, the chief source of re- 
cruits; the University was reluctant In this Initial phase of ths program to uss 
newspaper or other general advertising which might attract people who did not 
already have a considerable cosmlttment to library- In format Ion science. 
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The progran does, however, exclude non-Joiners; they report being alerted to the 
existence of the program by announcements in such places as Library Journal . 

2* Faculty 

Staffing the program was, of course, crucial. The two resident members of 
the Center for the Advancement of Library- Information Science were the Director 
and the Systems Analyst, the first a professional librarian and the second an 
IBM-trained systems analyst. With over five years of working together on Project 
URBANDOC, they had Integrated library and Information science at the working 
level long before the time to formulate the relationship as a theoretical propo- 
sition. There was also some Individual and joint experience at the workshop 
level with the kinds of librarians who later enrolled In the Professional 
Development Program. 

The basic teaching team was supplemented by laboratory assistance from 
the Graduate Center Computer Facility and by teaching assistance from Baruch 
College, which houses the City University's doctoral program In business admini- 
stration. 

In addition, the 1970-71 program called upon two leading practitioners In the 
field; Everett Brenner of the American Petroleum Institute and Melvin Welnstock 
of the Institute for Scientific Information. 

There are, of course, a great many other sources of qualified Instructors In 
the New York area, with several library schools. Information systems, and Im- 
portant llbrarlea located In or near the city. The present thinking, however, 
is to turn more to the various faculties of the City University, and particu- 
larly the doctoral faculty which has Ita headquarters also- at the Graduate 
Center. The present emphasis at the University on Interdisciplinary research 
la aapaclally relevant to library- Information science, which can profitably 
Incorporate expartlae from both the "hard" and "soft" sciences. 

3, Advlaora 

The potential broad academic baae of library-information aclence at CUNY la 
already rapreaanted by the membership of the Committee for the Advancement 
of Library- Information Science: 

Profeaaor James M. Bashers, doctoral program In sociology; 

Frofeaaor Barry S. Brook, Executiva Officer, doctoral program In mualc; 
Profeaaor Carl Halm, doctoral program In educational paychology and Director, 
Graduate Center Computer Facility; 

Dr. Richard Logsdon, C.U.N.Y. University Dean of Llbrarlea; 

Frofaaaor Margaret K. Rowell, Librarian-in-Charge, Graduate Center Library; 
Frofeaaor Vivian S« Seaaiona, Director, Center for the Advancement of Library- 
Information Sclanca; 

Mrs. Lynda W. Sloan, lacturar, Canter for the Advancement of Library- 
Information Sclanca; 

Profaaaor Lawrence R. Zeitlln, doctoral programs In buslnaas and paychology. 
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The formal Identifications do not, however, properly indicate the extent of 
individual expertise in library-information science. Professor Brook initiated 
and directs RILM (Repertoire International de Literature Muslcale) . 

Professor Beshers is an authority on the use of Census tapes, especially for 
demographic analysis. Professor Helm has extensive experience in computer 
technology, especially for educational applications. Professor Zeltlln is 
similarly experienced in management information systems as well as in the 
relationship between information and decision-making process. Dean Logsdon 
and Professor Rowell are particularly experienced in the areas of academic 
and research information needs and resources. 

The Future: Prospects and Caveats 

This is one situation in which it is completely accurate to say that a program 
enjoys the whole-hearted support of the administration. Dr. Mina Rees, 

President of the University Graduate Division, and Dr. Harold Proshansky, the 
Dean, not only support professional development in library-information science, 
they made it possible. Having made available the necessary Graduate Division 
personnel lines to establish the Center for the Advancement of Library- 
Information Science, they asked only that the Incremental cost of its external 
teaching responsibilities be borne by student fees* As Indeed it was during 
the first year, and is anticipated for the next. This degree of committment 
contrasts sharply with the Impression of Danton in the ALA study that most 
post-M.L.S. programs are closely linked to the availability of federal funds. 

The immediate prospects for continued and expanded professional development 
at CUNY are therefore excellent. As for the longer range one of the Ph.D., It 
is being pursued with all the speed that is possible under the administrative 
constraints of a large public institution. 

In the meantime, some words of caution are necessary to temper the predominately 
optimistic tone of this report. They have to do with the "science" part of 
library-information science. The thrust of the entire program appears more in 
line with the methods of the "hard" sciences than of the humanities that have 
hitherto dominated the library part of library-information science. Whole 
semesters of rigorous courses, seminars and laboratories are undoubtedly appro- 
priate to the doctorate, and even to the masters' degree. However, it is 
possible that non-credit students will be less inclined to push as consistently 
for proper mastery of their materials. The question then arises as to how the 
University should treat their requests for some formal recognition of attend- 
ance. 

Another set of caveats concerns the proper teaching of the courses. Although 
those that received specific mention above are reasonably straight forward in 
their staffing requirements, others will require more than one Instructor, 
and/or more than the standard timing. The full implementation of Professional 
Development in Library- Informat ion Science at The City University may, there- 
fore, require some decisions that other institutions are not prepared to take. 
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A SYSTEM APPROACH TO EDUCATION AND TRAINING 
IN INFORMATION SCIENCE AND TECHNOLOGY 



Herbert Schur 
Senior Lecturer, 

Postgraduate School of Llbrarlenehlp and Intonation Science, 
University of Sheffield 



SYNOPSIS 

In the systems approach the detailed dealgn and implementation 
of a prog reuse foil ova conaideratlon and dlacuaalon in depth 
of utilitarian and educational objectives and the assignment 
of prlorltlea, of environmental and other conatralnta, and of 
qualitative and quantitative criteria for evaluating the effi- 
ciency and effectlveneaa of the education and training pro- 
gramme. Theae and other factors are dlacuaaed. 



Introduction 

Tha ayatama approach! i* ■ method of problem eolvlng which, viewing a ayatam aa com- 
prlaing interacting and intardepandant aub-ayatama of man, machinaa, mat aria la, and 
money and itself a sub- ay stem of a wider ayatam, alma at achieving tha overall objec- 
tlvaa of tha ayatam in tha moat affective and affldant way. Tha mathod includea 
flrat and foremoat tha conaldaratlon of the objactlvaa of the ayatam and their re- 
examination in tha wider context, the ciimaldaratlon of conatralnta, and of criteria. 

It la tha lmpoalng of tha requirement for thinking in terms of tha wider context that 
dlatlngulahea thla approach from tha all too common narrow and parochial approach tha 
conaaquancaa of which may ba vary aarloua. To find a few examplaa of a failure to 
apply a ayatama approach in much of induetrlal planning we need to go no further that 
the polluted beachea, to breathe tha polluted air, and to aaa tha derallct land in 
almoat any of tha advanced countrlaa. A failure to apply tha ayatama approach in 
programs design may result in equally aarloua conaaquancaa, in terms of ala-employment, 
und ar- employment and unemployment. 

Assuming, then, that the ayatama approach may be of value in educational planning, we 
atlll have to aak whether education and training ia required in information acienca 
and technology. The fact that moat of tha information worker.* and raaaarchara do not 
poar^aaa formal educational qualifications in information aclance or technology and that 
many of them are auccaaeful may euggaat that formal education and training in thla 
flald la not nacaaaary and that moat, or even all, that tha information worker or re- 
aearchar naeda to know can be learnt 'on the Job 9 * An analogous situation ariaaa in 
management education and tha caas for an academic educational programme in that flald, 
mada by Frofaaaor Ellon 2 , appllaa with equal force hara* Ha atatedi "Let ua faca iti 
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some people go through an engineering college and never become good engineers; others 
become first class engineers without any formal education. Similarly, I do not think 
that the question whether management education can produce good managers or not is a 
relevant one. The question is whether it can produce better managers* Naturally you 
must have a flair for management and certain personal attributes, as indeed for any 
subject or profession you choose as a career, and without appropriate pre-requisites 
it is doubtful whether any educational programme can guarantee to produce good manar 
gers, good engineers, good doctors or indeed good practitioners in any field of human 
endeavour. But given the aptitude and inclination for a certain subject, surely the 
individual stands to benefit from a course which provides a broad background, which 
attempts to knit together relevant problems and issues and which relates past and 
present human experience in the field, rather than to struggle on his own and learn 
from his slowly accumulating narrow experience 

Objectives 

The primary objectives in education and training programmes are educational. It is 
only when we consider the programmes in the wider context that we have to consider 
the problem of education and training for what and the implications of these needs 
for the programmes. 

Following Bloom^, educational objectives may be grouped into three domains: 

(1) cognitive, 

(2) affective, and 

(3) psychoaotor. 

The cognitive domain, which will be considered in more detail below, includes those 
objectives which are concerned with the recall or recognition of knowledge and with 
the development of intellectual abilities and skills. 

The affective domain is concerned with attitudes and values. 

The psychomotor domain relates to the manipulative and motor-skill area and is there- 
fore largely of interest in the design of training programmes at the technician level. 

Returning now to considering the cognitive domain, we have, in order of increasing 
. intellectual difficulty, six categories: 

(1) knowledge (which includes the recall of facts and gene rallsat ions, 

methods and processes, patterns and structures), 

(2) comprehension (which includes the ability to make use of the material or 

idea being communicated without, necessarily, relating it to other 
material or seeing its fullest implications; it includes the ability 
to translate, interpret and extrapolate from facts or ideas), 

(3) application (which includes the use of abstractions in particular or 

concrete situations), 

(4) analysis, 

(5) synthesis, and 

(6) evaluation (including judgements in terms of internal or external evidence). 

In analysing existing programmes and courses within these, it is difficult to detect 
just what stress is laid on the various categories (for instance by studying the 
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examination papers) since solving a problem encountered for the first time may require 
the highest intellectual ability, yet if the problem and its solution had been encoun- 
tered earlier, only simple recall may be involved. The need for educating and training 
students for vork in a dynamic situation, rather than a static one, means that a right 
balance must be found between vhat Henderson^ called "knowledge- stuffed memorizing- 
based type of teaching" and the opposite extreme where a lack of factual knowledge may 
lead to the "re-invention of the wheel". Whilst much research is concerned with the 
re-examination of "facts", progress would not be possible if every problem had to be 
considered ab and if all, or even a large part, of accumulated knowledge were 

to be Ignored in education and training. There is thus a need for a student to acquire 
a knowledge of some baaic facts, generalizations, methods, processes, patterns and 
structures regardless of whether he is to proceed to research or to operational work 
in an information system. The problem is to select the bits of knowledge required to 
be taught (and therefore identifying the large areas of knowledge in which the bits 
required can be acquired by the atudent at a later stage, for instance in the course 
of hla research or operational work). Since the amount of knowledge which can be 
acquired by a atudent during the period of formal education and training is very small 
Indeed In relation to the total knowledge of a field, to attempt to cram more and more 
facta and techniques Into a programme will only lead to a massive indigestion on the 
part of the atudent and yet make barely an impact on the amount the student Is Ignorant 
of; tha aim should therefore be to enaure that the student learns to recognize the 
situations when he doea not know and to ensure that he does know how to go about ac- 
quiring the required knowledge. 

The second of Bloom 1 a categories la comprehension . This, as stated above, includes 
the ability tc translate. Interpret and extrapolate from facts or ideas. Translating 
includes the expressing of facta or ideas stated in one set of worda in other words 
(both between different languages and in the aame language, as when attempting to 
express ideas in non-technlcal terma), it includes the translating of mathematically 
expressed statements into verbal one:*, and via* verag^ and of graphs into mathematical 
or varbal atat amenta, and Utoa versa . It includes interpreting, for instance of 
instructions for tha uaa of daaalf icatiou or indexing schedules or of data presented 
in numerical table form, and extrapolating from data presented in various forma. Do 
we really make sura that students are capable of doing this? My impreaalon la that 
students who specialised in the physical aclencaa or mathnatlcs all too often aasume 
that onca, say, experimental results are presented in a mathematical or graphical form 
anyone will be capable of comprehending theae results without needing to be told in 
worda; on the other hand humanities background students will write at great length 
even where a graph or a mathematical formula would expreao all they have to aay in a 
cone la e form. It la not only a matter of comprehending facts and ideaa expressed in 
an unfamiliar form but also of an ability for extracting the full content of the 



Cloaaly ralstad to comprehension is application . Hare we expect the atudent to aearch 
for familiar elements in a problem, to use theae to restructure the problem, to aalect 
the theory, principle, idea or method suitable in solving problems of this type, and 
finally to uae thaaa in producing the solution. To take a almple caaas we wish to 
know how to aatlafy a national loan demand for scientific periodicals with the minimum 
number of multiple seta of these periodicals; that la, would it be better to have de- 
centralised regional collections or a centralised national one?* The atudent, by making 
aaaumptlona such as that the loan demand for any given volume of a given periodical 
will follow a Poisson-type distribution, and that tha coat of postage and packing is 
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the same for Che regional or nadonal service! v;..\n Chen produce figures shoving Che 
number of sees of Che periodical required for various loan demands and from Chese 
argue Che case for, or agalnsC, a centralized or a de-centrallzed service. The polnC 
here Is ChaC more Chan jusC comprehension (e.g. of Ceras such as "Poisson distribution") 
Is required. Programmes exlsC which Include a vide range of advanced Copies, one 20- 
hour (in total, noc per week) course, wich 40 hours of 9 practlcals 9 Including, inter 
alia M mechodology of research In InformaCion science and Cechnology, graph Cheory, 
Boolean algebra, information Cheory, operaClonal research, psychology of clssslflcadon, 
semandc Cheery, llngulsdc models, cransformadonal grammars, and a loc more besides. 
One may be a blc sceptical of Che abllicy of sCudenCs Co comprehend all Chls In Che 
Clme devoCed Co 1C, leC alone In applying 1c all. This Is noC Co say Chac such Copies 
should noc be Included In a programme, buC only Co polnC ouC chac mendonlng a vide 
range of Copies Is noc sufflclenC. Time has Co allowed for carefully selecCed Copies 
Co be sCudled In depch, so ChaC Che knowledge gained can be applied. 

An educadonal programme, pardcularly for higher-level students, should ensure that 
analysis Is given sufficient weight. Before solutions can be proposed, problems need 
to be analysed, aitd this applies equally to operaClonal work and Co research. But 
analysis, and by chls I mean Che breakdown of structures (e.g. systems), processes 
or statements into their constituent parts and the critical and logical study of the 
parts and Che whole, should not be considered as an end In Itself, but should be viewed 
as the prelude to further work, not necessarily of synthesis. Statements like "infor- 
mation Is a raw material" or "information Is a commodity" should be analysed, ss 
should such very difficult concepts as "relevance", or "information needs" of users. 
There Is a large number of topics on which useful analysis can be carried out and even 
though no definite conclusions may be reached (e.g. on the meaning of "Information", 
on appropriate measures of "benefit" of an Information system), the analysis of the 
problems will In Itself help to clarify at least the areas of our Ignorance. 

Whilst analysis enables the students to dissect, constructive work requires synthesis . 
It Is synthesis which appeals to the Imaginative mind and It is this Imagination which 
should be fostered In an educational programme. Developing a capability In synthesis 
may be achieved by means of system design studies, by report vTltlng, by planning 
exercises, and other means. It Is the development of this capability which appeara 
to be most neglected In schools and even In the undergraduate courses In universities 
and It Is this which was stressed In various major reports on engineering education^ 
and In my recent discussions with deans and profeasora In aome of the leading schools 
of Information science In the USA and Europe. 

Finally, evaluation . This includes judgement In terms of Internal or external avldence 
Involving the use of logic and estimation of probabilities '(among others). In general, 
scientists, and particularly physical scientists, are used Ko Judging on tha basis of 
data which are on st least the ordinal scale and, perhaps because of this, appear to 
be unhappy when asked to make value judgements as In soma major management decisions. 
It Is possible to teach how to evaluate and judge on the basis of objective (snd 
quantitative) data, but such exercises remain 9 theoretical • If the student bears no 
responsibility for the consequences of his judgements. Tha student who will cheerfully 
decide that an amount one hundred times his annual sslary be spent on a machine for 
the hypothetical Information service, will, when later required personally to be res- 
ponalbla for sanctioning an expenditure one tenth of his salary feel weighed down by 
responsibility. Can 9 games 9 really simulate real-life situations? Nevertheless, 

9 games 9 may be of value In bringing out some of the problems of making judgements and 
thua prepare the student to face reel-life situations forewarned and forearmed. 
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Professional objectives 

Since education and training does not take place in a vacuum, objectives of the pro- 
grammes have to be considered in the wider context* In systems language, the programme 
becomes a sub-system of the larger system of education and training in the country, 
which itself is a sub-system of national policy, and so on. 

In determining the professional objectives of a programme, usually at least four in- 
terests need to be taken into account, namely those of: 

(1) the individual, 

(2) academic, 

(3) professional, and 

(4) national* 

The ind iv idual (i.e* the student) wants the best possible education and training pro- 
gramme for his needs* But what is ’best' and what are his 'needs'? The individual 
is usually singularly ill-prepared for answering these questions and therefore de- 
cisions may need to be taken on his behalf by those with experience in the field* I 
would think that in designing a programme consideration must be given to the career 
expectations of the graduates, which must not be worse than the expectations of gra- 
duates in related fields (e«g* science, engineering), in terms of financial reward, 
prestige, and interest in the subject. We must avoid developing programmes leading 
to dead-end Jobs, however great the demand from 'the field' for such personnel, and, 
on the contrary, we should aim at providing an education for flexibility* Education 
for flexibility will not only increase the employment opportunities for the individual, 
but also by exposing him in the course to completely new topics will broaden his out- 
look and, I would hope, increase his interest in the subject. There is conflict here 
between the shor* -term and long-term benefits of a programme for the individual, be- 
tween the benefits to the individual and to the employer, and between those of the 
employer and the state* So long as the conflict cannot be resolved by an appeal to 
objective evidence, an essentially subjective decision has to be taken, and, in the 
circumstances prevailing in the U*K«, I give considerable weight to what I believe are 
the needs of the individual* However I am not so sure that an equal weight would 
necessarily be appropriate in, say, a developing country or Israel* 

The acad emic objectives are concerned with promoting fundamental theoretical and 
experimental studies* From the point of view of academics, advanced level programmes 
need to provide adequately educated and trained candidates for research work, and 
therefore programmes must include the necessary foundations* 

■ Professional institutions are concerned with maintaining a high level of quality in 
those entering the profession, the level being similar to that demanded by professional 
Institutions in related fields* The usual set of requirements includes a prescribed 
level of general education, of the specialised education, and of practical experience* 
It is usual for the major professional institutions to recognise certain academic 
institutions, or particular programmes, for the purposes of giving full or partial 
exemption from the professional qualifying requirements of the institution and it is 
obvious that a programme which is recognised by a major institution will attract more 
good candidates than one that is not* In planning programmes it is therefore useful 
to note (but not necessarily follow) the requirements of the relevant professional 
bodies* 

Finally, national objectives* The primary objective is, of courae, survival and 
linked with this is the economic, social and cultural development of the country and 
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nation. As already mentioned, there is obviously a source of conflict between the 
objectives of the individual and those of the state, but in an honest (l.e. not 
corrupt) democracy the gap separating the objectives is not likely to be unbridgeable. 

To illustrate: sources of conflict nay be created by the broad versus specialised 
programme requirements, education v ersus training, requirement for operators versus 
research workers, short-term versus long-term needs, and so on. Tn a democracy the 
state can only exert positive pressure, for Instance by providing funds for the sup- 
port of programmes which are considered to be in * the national interest*, it cannot 
prevent a programme from being Implemented {although lack of financial and other sup- 
port may make it difficult to maintain such a programme). Again, the implication for 
the programme designer is that national objectives should be noted but that their 
'official* interpretation should not, necessarily, be accepted. 

Categories of work 

So far 1 have been discussing education and training prograone objectives in general 
terms. 1 now wish to turn to a discussion of the various categories of work. 

As we had already stated earlier^, a useful division can be made into four categories, 
namely: 

(1) production and operation, 

( 2 ) development and design, 

(3) applied research, and 

(4) background research. 

In considering the design of progrannies in relation to the requirements of the 
production and operation category of work, as well as of the other categories, the 
first question we have to ask is "what is the business we are in?". Does information 
work Include, for Instance, all aspects of data processing including automatic equipment? 
It is concerned with "all the phenomena and processes mediated by signs, the carriers 
of information", or is it concerned primarily with the "collection, collation, evalu- 
ation and organized dissemination of scientific and technical information" only? The 
definitions of information science and technology differ widely, but perhaps a des- 
cription of the field is emerging in publications such as the Annual review of infor- 
mation science and technology , periodicals such as Information Storage and 'retrieval . 
Nauchno-tekhnicheskava informataiva . Journal of documentation. Journal of AS1S . and 
abstracts such as Library and information science abstracts, teferatlvnyl ihumal i 
n.t.i . and its English version. The production and operation category will require 
the largest share of the output of education and training prograwes in information 
science and technology, and that at a range of levels. 

I believe that development and desian is a key activity in all but the smallest opera- 
tional units and therefore the requirements of this activity must be taken into account 
in the design of programmes. 

Applied research is required both to back-up the development and design effort and to 
provide the means of bringing the findings of background research to the stage when 
these can be used in design. The applied research worker will need to comprehend the 
background research findings and analyse and synthesise these for the benefit of the 
design workers. 

Finally, the backaround research, whilst fairly remote from the problems of day to day 
operation of information units, is required to provide the scientific basis for the 
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development of new methods end means In Information science and technology. It will 
neceaaarlly be cloaely linked with work In related flelda, such as psychology, lingu- 
istics, communication atudlea, and engineering aclence. 

Priorities In cat eaor lea of work 

Looked at from the point of view of national needs It Is obvious that the first prior- 
ity will be for operational staff. What la not ao obvious Is that even In a develop- 
ing country some ataff will be required for development work since Information systems 
can remain unchanged only at the rlak of becoming increasingly Ineffective, Inefficient, 
and obsolescent. 

I would rate design capabilities aa a cloae second priority . Adapting systems Imported 
from abroad may be uaaful in a high proportion of caaes, but there will always be par- 
ticular local needa for special purpoae ay stems which are best designed by one with an 
Intimate knowledge of the local conditions and requirements. I am referring here to 
the dealgn of systems not only in caaea where security considerations prevail, but 
also of ayatems such aa district or national library and Information syatema. 

Applied research may be conducted on an ad-hoc baela, that Is whenever the need arises 
for providing background Information for deaignera, or on a more systematic baa la in 
accordance with some policy. The piecemeal sppraoch, which Is essentially based on 
trouble-shooting, cannot provide a basis on which education and training programmes 
can be planned, and therefore a national policy ahould be evolved designating certain 
areas aa priority areaa. These priority areas would Include flrat thoae areaa of 
Interest to the country on which little or no research la being carried out elaewhere 
(for Instance problems connected with the language barrier? and language proceaalng), 
secondly problems apaclflc, or of primary interest to the country (problems connected 
with specialised Information centraa), and only thirdly problems of general Intareat 
In Information aclenca and technology • This may, of couraa, not be a popular view 
since the Individual research worker may gain more prestige internationally by working 
In areaa of more general intareat, but the conaaquenca of thla attitude will ba that 
the local naada will be neglected and daalgn and dsvalopmant will suffer. If priority 
araaa are known then centraa concerned with thalr atudy can be developed and conse- 
quently education and training programmes baaed on tha expart laa In thaaa centraa 
can be planned. 

Background raaearch, on tha other hand, appears to ma to be eaaentlally 9 academic 9 , 
and although poaalbla ultimata applications may be Imagined thaaa applications are 
vary remota and tha raaearch la primarily oriented towarda an Increaaa pf knowledge 
of the flald rather than tha potential application of thla knowledge. It la not 
really feasible to plan background raaearch, both bacauaa tha number of powerful ori- 
ginal thlnkara who are abla and willing to devota themaelvaa to background raaaarch 
la vary limited, and because original thought cannot be planned. Tha beat that can 
be dona la to attempt to promote intareat in information aclenca and Ita problems 
and to encourage those willing and abla to work on thaae. 

Even If It la decided to support education and training for catagorlaa othar than 
operation, thare la no aimpla way of determining what proportion of tha total affort 
will be required for thla support. It may therefore by uaaful to nota that In engi- 
neering, much of which originated In craft-baaed induatrlaa and not In raaaarch 
laboratories, and much of Information work practice la still In tha craft ataga, the 
proportion of the graduate manpower engaged In raaearch and development ha a reached 
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about 35-40% (in countries like USA 8 or the UK^) ; it is therefore not unreasonable to 
expect that a proportion of a similar order of magnitude will be required In information 
science and technology by 9 say, 1990. 



We have, so far, asked and outlined answers to questions such as: 

(1) why do we need education and training programmes? 

(2) if we do need such programmes, what are the educational objectives requiring 

to considered in the design? 

(3) and what are the objectives the individual, the academic community, the 

professional community and the state wish the programme to achieve? 

(4) there exist various categories of work: what are the principal ones requiring 

to be considered in the design? 

(5) and what are the priorities in providing for education and training for 



Constraints 

Programmes are implemented not in some ideal environment, but in a real one* We 
therefore need to take into account the constraints imposed on the programe design by 
various factors* In parenthesis, it should be stated that, as far as I am aware, no 
useful studies had been made of the sensitivity of the effectiveness and efficiency of 
programmes to these constraints (e.g. what is the effect of a lack of material facili- 
ties, of good teachers, of different educational systems, etc # ) # 

Let us consider some of these constraints: 

(1) the HIP barrier (l*e* history, incentives, psychology) 

(2) educational system 

(3) students 

(4) teachers 

(5) material facilities, and 

(6) finance* 

The HIP barrier is by far the most important one for unlesa there is the will and de- 
termination for change no action will result* Following Gallowayl®, the history* 
element of the barrier includes not only the history of the individual person concerned 
with the change, but also the history of the institution, of the profession, and of the 
country. All are greatly affected by their past and by the effect of this on their 
attitudes to change, and particularly rates of change* In practical terms one conse- 
quence might well be the need for developing a completely new prograase rather than 
attempting to modify an existing one* 

Change can be brought about, or speeded up, by general exhortations, by the 'carrot' 
and by the 'stick' method* General exhortations and resolutions, so popular with 
some governments and international bodies, are of little value**. The 'stick', that 
is penalties and sanctions, may produce the desired results but at a considerable loss 
of good will and enthusiastic cooperation* We are therefore left with what, in my 
view, is the most desirable method, that of the 'carrot'. Incentives are l survival 
(both national and individual), economic well-being, prestige, and interest in work. 
Some of these incentives have already been dlscussad in relation to objectlvas, and 
it only needs to be added that a wall deslgnad national programs of education and 
training in information science and technology will make a positive contribution in 
sll these areas* 



Summary 



these categories? 
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Psychology Is the third element of the HIP barrier and is the one least written about 
in accounts of programmes. Nevertheless it is a very real factor and cases abound 
where progress had been held up because individuals would not co-operate for reasons 
such as imagined (or even real) 'loss of face', or loss of prestige. Obviously inside 
knowledge of the likely reaction of personalities to be Involved in and affected by 
any proposals considered for implementation in the programme is of great Importance 
since this can be taken into account in the design* It is not only what is proposed 
but also how it is proposed which matters. 

The second major constraint is the educational system which determines what is acceptable 
where , for whom and when 9 and how it is to be implemented. If a programme is to be 
implemented in a university 9 for instance 9 then the design must be such that the pro- 
gramme is acceptable in its level 9 form 9 and other parameters to the institution con- 
cerned. To the extent that educational systems differ between countries (consider just 
the place of engineering education in some of these), to that extent will vary the 
assumptions underlying similar level programmes in these countries. 

The principal constraints relating to students are their intelligence, educational 
background and practical experience, motivation, age, and numbers. Admission require- 
ments to most existing programmes specify some, or all, of these although, sofar, no 
figures appear to have been published to show the sensitivity of programme effectiveness 
and efficiency to variations in these requirements. There exist, however, some figures 
in relation to undergraduate programmes in a universltyl2, which indicate that the re- 
lations between academic progress and factors like intelligence or educational back- 
ground are not quite as simple as might be expected from the rather rigid entrance 
requirements specified for aome programmes. On the other hand students with a rela- 
tively low l.Q. and a poor record of educational progress are much more likely to fall 
than their fellow students. Similarly, atudlea have been carried out (although not in 
the field o£ education and training in information science and technology) of effects 
of motivation, age and numbers (in aeminara, in a univarsity aa a whole), on succeaa 
in prograMea. The reaulta of theae atudlea do, of courae, indicate the importance, 
or otherwise, of theae factors, but for prograaia dealgn and implementation more re- 
search into theae factors would be desirable. 

The shortage of suitable parsons who are abla and willing to taach la, in many caaea, 
a aerloua obstacle to the development of programmes, particularly thoaa for higher 
levela. The dealgn of prograaiea muat therefore take thla factor into account and 
build from the available to the dealrabla on the baala of the attainable at the tlma 
the prograMe la being planned for. To wait until the whole programme can be imple- 
mented by flrat class teachars mean a to postpone the implementation indefinitely end 
thla, in turn, means to destroy the programme, since objectives, conatralnta, and 
criteria are bound to change in a dynamic environment. 

Similar considerations apply to the availability of material facilities . Thaaa faci- 
lities often axlat, or can be built up during the planning ataga provided that tha 
requirements are realistic. In moat countries fairly large computers are available 
and collections of the lltaratura of Information science and technology can be built 
up to complement existing collections of academic and technological lltaratura. Hara 
I would Ilka to atraaa, again, the importance of periodicals ralatlva to textbooks, 
particularly for programmes for tha higher levala. Hera, too, thera la conaldarabla 
advantage in Implementing prograamaa in existing educational institutions (or soma 
larga raaaarch organisations with educational facilities) with their larga libraries 
. and othar facilities. 
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Finally! finance . The provision of financial support is crucial to the planning and 
implementation of programmes. Both the total amount and any restrictions on the ex- 
pendlture need to be taken into account in programme design. This applies not only 
to the provision of funds for the build-up of material facilities and for remuneration 
for the teaching staff , but also the provision of finance for the support of students. 
From the national point of view. If research and development are considered desirable 
then the provision of Information services Is desirable v and if this Is so qualified 
staff must be desirable. This implies that their education and training should be 
considered as of equal Importance to the education and training of research and develop- 
ment workers (scientists, engineers, economists, etc,). 

This brief sketch of some of the constraints shows that even for Identical programme 
objectives different solutions will require to be found In different countries, or 
even In different regions of the same country. This does not of course mean to imply 
that every component course of a programme will need to be designed ab^ initio % but if 
It Is to be taken over ready made a suitable one will need to be selected. 

Criteria 

There appear to be potentially three useful Indexes for appraising education and 
training programmes: 

(1) effectiveness 

(2) efficiency, and 

(3) benefit/cost. 

When asking whether or not a programme is effective , we are asking whether or not the 
programme Is achieving what It was designed to achieve. If we ask, ”how effective?”, 
we are attempting to determine the ratio of actual achievement to planned achievement. 

We have earlier distinguished between (1) educational objectives, and (11) professional 
objectives. Effectiveness Indexes can be determined provided that we can quantify the 
objectives. Considering first the educational objectives It appears likely that a 
series of assessments of the students, by means of examinations, seminar performance, 
course work, laboratory work, and minor research studies, might provide a reasonable 
basis for an educational eff ectlveneas Index, Whilst It Is true that all these asses- 
sments are subject to error, as studies of marking of examination papers have shown, 

It Is still the rule that students' performance Is assessed and (at least In British 
universities and professional examinations) the use of external examiners does tend 
to ensure a reasonable uniformity of standards between universities and between 
professional Institutions, 

To quantify professional object Ives Is considerably more difficult. Is the programme 
to be considered 100X effective If all Its graduates find posts In the area of Infor- 
mation science and technology for which the programs^ was to prepare them, or are 
only those graduates to be considered who 'survived' a given 'probationary' period 
In employment (excluding those leaving for reasons of health or for family reasons)? 

Are only those who achieved a predetermined level of seniority to be counted? Is 
salary relevant? And publications? 

When asking, ”how efficient Is the programme?”, we are attempting to find an output 
to Input ratio. We may, for Instance, use the ratio of graduates to students at the 
start of the programme, or the pass rate, or some similar ratio relating to students. 

We may, on the other hand, have a mixed ratio like graduates to cost of programme. 
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where the cost needs to be defined closely • To amplify: cost may Include all the 
Inputs, such as manpower (teachers, administrative staff, and the marginal cost of 
other persons, plus the cost of students, Including their loss of earnings), materials 
and machines, and overheads Including depredation (an Item which may be particularly 
heavy If one Includes depreciation of software and Investment In programme preparation). 
The graduate to cost of programme ratio has also been used In cost-effectiveness 
analyses 13 either to find the cheapest means of accomplishing a defined objective or, 
finding the maximum output for a given expenditure. 

The benefit to cost ratio requires, again, a definition of the terms used: 'benefit' 
and 'coat'. Whilst 'cost' Is often defined In the sense of Input costs, as Indicated 
In the preceding paragraph on efficiency, • 'benefit' Is still subject to much discussion. 
The Importance of this ratio Is that given a limited amount of money available for 
expenditure on all kinds of necessities (or even luxuries), a decision has to be made 
on how to spend this money to best effect. Would the same Investment In promoting 
(say) other programmes bring greater benefits? And what would be the losses If the 
programme were not Implemented? Benefits and costs, to whom? 

A common approach Is to use a 'rate of return' on Investment, that Is the return to 
programs graduates In terms of the total lifetime Increase In earning resulting from 
the programme as compared with the cost of the programme. The financial bensflts are 
not the only ones, but other than financial benefits are difficult. If not Impossible, 
to quantify. There are, of course, considerable difficulties In calculating the life- 
time Increase In earnings, firstly because programmes In Information science and 
technology are of too recent an origin and therefore none of the programme graduates 
has worked for 30 or 40 years following graduation from the programme, and secondly 
It la difficult to estimate what the Individual might have earned had he not parti- 
cipated In a programme, that Is, estimating the degree to which students opting for pro- 
grammes In Information science and technology are representative of a larger body of stu- 
dents attending prograaies, In related fields (e.g. science, technology, or economics). 

The benefit to cost ratio Is probably best ussd not In comparisons of Investments for 
different purposes, but In comparing programmes aiming to achieve the same, or similar, 
objectives. 



In this paper we considered three primary components In system design of programmes 
of education and training in Information science and technology. Wa have, Intentionally, 
left out consideration of designing particular programme and therefore wa have not 
discussed methods of forecasting needs and levels of work (which are likely to be de- 
termined by analogy with related fields). 

Following a brlsf note on the need for education and training programmes, we considered 
both educational and professional objectives (and briefly discussed catsgorlss of work 
and the priorities), a variety of major constraints, and, finally, some of ths criteria 
for assessing education and training prograraes. 

We hope also to have Indicated some of the areas In which mors rssaarch Is required If, 
In future, programmes are to be designed on a more scientific basis than In ths past. 



Conclusion 
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1HE MEED OF HIGHLY QUALIFIED LECTUMERS FOR THE 
TRAINING OF INFORMATION OFFICERS 

, G. Ralchardt 

Head of the Control Librory of the Nucloor 
Roooorch Con tor Jilllch 



SYNOPSIS 

Booldoo oil now programmes for tho training of information 
officers, woll experienced locturoro oro necesssry for ooch 
training* Tho lecturer must bo oblo to inopiro hio hoororo 
oo for. and oo much that they toko enough with thaw, ready to 
attack now and better idoao and to become active during tho 
whole of their profoooionol life. Consequently post-graduate 
seminars for lecturers should bo regularly given in ell coun- 
tries of the world. An educational system for lecturers in 
library and documentation services should be organised at once* 

When you invited me to lecture I was quite sure that I wouldn't prefer e common-piece 
subject such as education in Germany or in the town of Jiillch or in our Nuclssr Re- 
search Center* I didn't wish to psy homage to local patriotism that didn't seam to 
be reasonable* I have had experience in the training of documentallsts at Frankfurt 
and in the training of scientific librarians at Cologne* I believe that many an edu- 
cation programme of today is often non-effective, obsolete, filled with ballast* The 
basic concept that documentation end information sciences ere applied sciences end 
social science, is missing. Therefore, there is no education for the performance of 
activities beyond the reach of libraries as such* 

That wider ranch means i teaching methods, how to advise, to provide, and to satisfy 
the individual user? That wider reach means i lessons about the various means of dis- 
covering unknown literature, often representing secret literary paths for the asto- 
nished user; lessons about methods, how to encourage the resder to read the same 
book as oneself; lessons about the assistance e documentellst should end could give 
to oach visitor wanting to learn and to ell those entering e library - the documen ta- 
ils t has to be trained in interhuman relations* 

Therefore, we do not often asks where do lecturers come from? Where ere well experien- 
ced teachers? 11 * The problem, how to manage information is rarely recognised) but 
education methods ere not discussed at ell* 

Today's lessons in llbrerlanshlp and documentation ere characterised by the features of 
e sort of pre-school trainings educational literature about training in llbrerlanshlp 
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or In documentation activities presented in form of books, lectures, programmes, 
conference papers, or in other endless masses of papers, and that hardly any more 
has meaning than business forms or the bureaucracy of documentation work: library 
and documentation work are represented in the form of a one-sided conversation, 5 
The lecturers seem to look at the students like married spectators Hatching a wad- 
ding and saying: "Let them too see how life is!" 

Should education be regarded as successful when the level of knowledge has raached 
a certain mark of success in the same way as the calculable successes in the produc- 
tion of refrigerators, motor-cars, and other Industrial products? By no means! 

For a good lecturer must not only take into consideration the human and social re- 
lations of libraries and documentation centers during his lacturaa, but also must 

point them out* Without establishing these relations for the audience, for students, 

for users, we could abandon all our documentation aervlce, catalogues, book collsct- 
lons, and bibliographies - and last but not least - ourselves too* 1 found thle 
fundamental attitude beat and moat clearly applied by the University of Pittsburgh, 
Pennsylvania, in its educational prograamm for Information Science, that attrlbutee 
great importance to the "Foundations of behavioral theory. "3 

Where are the difficulties coming from which our modal structure has put up against 
a real comm 1 cation of information? Actually, these are difficulties which ths 
Enllghtment has had to overcome In all countries and in various manifestations sines 
200 years* You find the idea expressed in a popular German proverb: Bsttsr ths 
devil you know, than the devil you don’t know! This provsrb as applied to communi- 
cation of information means: It is not absurd: The person truly responsible for the 
efficiency of information la not ths one mho informs but that ona who la Informed**. 
According to that discernment* ths Information process la datarmlnad as an event, where 
the receiver, the atudant, the auditor ia ths important partner* To request an im- 
provement of educational methode mesne understanding that documentation work and 
librarians 9 knowledge do not unfold theme elves like a magic flower, but it means, 
that in the long-known educational and didactic qualities, r aqua a ted by Feetalosal, 
Comenlua, Kerachenstainar or Bilan Kay, Iduard Sprangor and Anton Semjocovich 
Makarenko, should also be required for studies of llbrarlsmahip and documentation: 
vis* enthusiasm for ths subject, patience for the audience, continuity, prosecution 
of the fixed aim, dynamic formation of the lesaoma by adding Intellectual stimulus ’ 

I without which atudeoto respond poorly to lecturers* "Information obtains soak*' but 

encouragement obtains all*" 1 Ths teaching of the subject only represents sc licks 
i without the students 9 own participation* In that way we train only enginasre sf 

[ catchwords* Soon after the examination, all subjects taught fall off ths walls 

} Ilka inferior house-plaetar* 

Ia general: Docunontallata are trained with am orientation teverde forthcoming esse 
i mlnatlona, but actual ly documam tel late oriented towards practice ought to be trained* 

i for a good lecturprrwct he ohlo^te dia gn ose, to prognose, < is* judge the situation in I 

ordar to lecture and to reeSMOodtbe right therapy, the right /decisions* the right 
j means* The only reel method of modern education is exclusively the right amolpciB • 

of aim and mesas i that means, mm analysis of the professional' situation at librefias 
and documentation centers and am analysis of 11 tore ter a courses* This education, . 
called "training am the job" shall prepare tho etudeats to take over a special ps*l* 

I tion and to fill a special fumeting* Thtsrgaaiysle wham carried put as deliberation ; 

and decision should result in a syetms of relation between taachar and student, tbe * 

t * • - - 

s- 
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method of which can be ehown at a pragmatic model implying good and critical relations 
between teacher and atudent and including a good feedback to the teacher, so that the 
ideae of library and documentation considered to be fundamental and obligatory may be 
submitted to modification and correction. 

During library and documentation lessons we are learning to answer the questions: 

How? How to proceed? How to analyze and how to catalogue? How to find literature 
for a special problem? But where are we taught to ask: Why shall we act in a cer- 
tain way? What doea thia card index actually accomplish? Which rules and regula- 
tions ara applied to the administration of the branch library, i.e., of the branches 
of inatltutaa and laboratories? What is the matter with the basic strategies of 
cataloguing? 13 Why la that place of employment saved? Why is this colleague preferred 
and that one dlamlaaed? Where can we learn to be more than a well turning small wheel 
within the machinery of an institution of industry or research? Where can we acquire 
tha knowledge required to aak 9 to think, to judge in a self-reliant way? Where are 
we taught to recognize the margin of action of a documental is t and to assume respon- 
sibility for ourselvea and othara? Where can we learn that it is indispensable to 
know for each information the relation between the various branches of knowledge in- 
volved , whose theories and methods have become extremely complicated? Where can we 
learn that language of aclanca has become an artificially constructed language based 
on formalism and symbolism, where a development of pragmatic-semantic aspects results 
in purely syntsctlcsl questioning? We must learn to reach decisions, according to a 
wall prepared planning, to prognosticate, to prepare decisions according to need .12 

We should encoursge the inclusion of studies of the operation, economics and use of 
existing services and of the need for further development. The training programs 
should consider the management problems of libraries and Information systems and 
should identify the object ivas of future planning and operation and the types of 
information that era required. The students should learn something about technolo- 
gical forecasting, about PPBS, about analysis of needs, about the mechanics of deci- 
sion making, about RID programing and goal determination. The educational refor- 
mation must start at this point. The practice of a lecturer will have to assume 
information about the students' interest in life and about their expectations con- 
cerning education as well as a critical opinion and a clear understanding of the 
present problems. How to educate such documentallats? Where are appropriate 
lecturers to recognise and to solve those problems and to assume conscious respon- 
sibility? Consequently there is a correlation between the request for experienced 
taachara and the efficiency of education and there exists the same difficult ques- 
tion of the efficiency of lectures. For the lecturer must be able to inspire his 
students? so far and so much that they will become good information officers^: that 
he will be able to establish communication from brain to brain. 

The subject of the lectures should be based on the need for things which should be 
dons in library and documentation world, and accepted as their mission by the stu- 
dents themselves. This knowledge may consequently be derived from two main sources: 

Firstly: Hors attention should be paid to the audience, to the students and to the 
uaars of information! That maana mors studies, more research and more lecturing 
about the environment of library and documentation center .4 

Just as industrial production can't be stimulated without marketing, library and 
documantatlon center, too | and education in librarlenahlp and documentation ahould 
take into consideration, mors than until now, the environment, the audience and the 
user that means l.e< * students and their future duties. 
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Secondly: 1 don't know whether our time Is better than the past or the future r but 
In spite of all controversies In this regard It Is marked by one evident advantage: 

It admits criticism* 

And concerning the methods - we should seek Increasingly to Identify those methods 
of study that can best provide the highest type of training and that can stimulate 
the application of proven techniques , data, and knowledge* Knowledge Itself occupies 
the second position* At this moment, when for the first time the student feels when 
listening: "That Is of great Interest to met - at this very moment the long, enticing 
way of a do cumen tails t starts, rich In advantages, In experiences, In disappointments, 
which nobody knows but you yourselves* That means In consequence, not to strive me- 
thodically for the moment In regard to which Faust longing says: "linger a whfle, 
you are so beautiful" • As to the method we can therefore proceed In 3 different ways: 

1) with pathos, viz* complaint, praise, heroic songs of praise, gliding, accentuation 
of culminating points In development, underlining of evident pluses or minuses are 
the most Important elements* 

2) with a Irony viz* the lecturer reflects Ironically on the deficiency as an effect 
arisen In the subject* Tension between longing and fulfillment Is reflected; a de- 
ficiency Is represented by Irony as difficult to overcome and hardly avoidable* Or, 
a deficiency may be represented as cultivated laziness or mistakes may be unmasked 
In a wrongly programmed mental range or unpleasantness of life occurring everywhere 
In library and documentation service, and which 1 meet too - may be presented* 

3) with a sense of humour, viz* the powerful palliative against a deficiency not yet 
overcome, against Incomplete Intelligence and lack of knowledge* 

Therefore, It seems Important to me to take into consideration - for any education 
needs lecturers - that an educational system for lecturers In library and documenta- 
tion services should be organised at once* For a high standard of knowledge doesn 9 t 
qualify a lecturer or a teacher: Host of the library and documentation schools find 
their lecturers In the same way, as one finds a $100*- note, by accident, unsystema- 
tically and most of them do not find even one* They mostly find cents, that means 
"In general: the teacher Is not prepared for the library and documentation centre of 
tomorrow, but for the one of yesterday*" 10 

The position of a professor In the university’s scope or of a director of a museum, 
or of library or documentation canters does not guarantee the ability to teach or to 
act as lecturer* For not only the subject matter should be taught, but It should be 
shown also how to develop the subject and how to acquire a personal relation to It* 
Even the lecturers 9 work should be submlttsd to control, censorship, and sxamlnatlon* 
Qualification of the educators, lectursrs, and places of education must be fixed* 

The authorization to educate should be limited to three years* A control of educa- 
tion courses and the determination of educational aspects should be Introduced* A 
most specific educational strategy against inflexibility and Inefficiency must be 
developed* 0 We must try to Improve a great number of relations between the educa- 
tional system and the educational environment* You sse that the main problem Is: 
how to arrange for an actlva educational system* Modernization of education also 
depends on providing the educational system with means for Innovative processes and 
on creating an adequate modern administration* In fact: all deulrabis measures of 
performance are reducible to the user criteria of reliability .md efficiency - and 
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the management criteria of coat. Otherwise, It only represents a pretended school, 
a pretended education. The critical self-control to which studenta should be raised, 
should be aet aa an example and demonstrated by the lecturers who ought to have this 
ability, 2 A better control of the results In education should be Introduced. 

But an education for lecturers which Is oriented towards application and practical 
work does not exist •• 

The greater Is the uneaalneaa In special libraries or documentation centers, which 
employ scientists who are not at all or poorly educated, that means, uninitiated 
employees, who have Ideaa but no complete educational background. Consequently 
post-graduate seminars for lecturers - not only for Information officers - should 
be regularly given In all countries of the world In order to lead out of Ivory 
towers and to bring every routine training programme to life. 

I am closing my lecture with the following old story, which will demonstrate, that 
this fear Is very old and that hope and good wlahes are justified. 

"When I was on my travels, I met a huge haycock which had collapaed and lay across 
the road. The peasant, standing bealde, called to me to help him to set up the 
cart. That touched my haart. We had two planks to hand. We forced It under the 
cart and levered tp the bast of our ability, the vehicle tottered, rose up and 
stood and we loaded tha hay again. And the peasant aald: "Nobody knowa whether he 
can do something unless ha has tried it and because you got In my way. The cart 
collapsed. In ordar that you say help me. What else have you learned If you 
haven't learned that everyone has hla own way of duty? What: would be a God who only 
would have one single way of service? Ha doesn't exist. There la a way to serve 
God by praying and there la a way to serve Him by good deeds for other people and 
tharu la another way to sarvs God by learning. 

This etory la not my own story, I found it In the book "Gog and Magog" written by 
Martin Buber. 

It aynbollsee our eubject, for the theory of library and documentation service la 
actually not different from the assistance to the great number of "hay carta" that 
have collapsed. Even lacturera of Information science should learn to are thoae 
"hay carta" and to draw the listeners' attention to them. We all ahould help, 
setting up the "hay carta" collapalng every day. We all ahould wish that It ware 
only collapaed hay carta needing our help. 



REFERENCES 

1. GOETHE : 

2. GOETHE : 



3. RENT, A.: 



Artamla-Cedenkauagabe, vol. 8, 1961. p. 104 

Artemla-Gsdenkausgsbe, vol. 8, 1961. p. 607. Goethe'a 
atatement on tha author, la alao valid for tha lecturer: 
"For tha author raraly la unprejudiced judge of hla own 
works." 

Information Sciences-Devslopment of an Interdisciplinary 
graduate education program at the University of Pittsburgh 
(American Documentation Inatltute. 26th Annual Meeting, 
Chicago, 111., Oct. 1963. p. 111-114). 



533 



..175 



VI Selection , Education and Training of Personnel 

4. GOETHE : Artemls-Gedenkausgabe, vol. 14, 1964* p. 915* 

5. GOETHE s Artemls-Gedenkausgabe, vol. 22, 1964. p. 651. 

6. NATHALIE SARRAUTE : Cited by Ludvig Marcuse: Argumente und Rezepte. MUnchen 

1967. p. 60. 

7. VINOKUR, M.M., et al.: Request: "considerable encouragement of librarian activity. 

for the lap r oven ent of the system of recommending biblio- 
graphy of technology. Complex planning. Investigation of 
theory and methodology, renovation of classification, cri- 
tique of literature and suamury and expansion of the best 
experiences.” (Vinokur, M.M. and others: Technical biblio- 
graphy and bibliographical Information activities of the 
technical libraries. Sovetskaja blbllograf Ija, No. 3, 
1967. p. 91-131.) 



8. COOMBS, P.H.: 

9. HOLM, B.E.: 

10. COOMBS, P.H.: 

11. GRUNWALD, W.J 



12. MARTYN, J., and 
VICKERY, B.C. : 

13. TAGLIACOZZO, R., 
ROSENBERG, L. , 
KOCHEN, M.5 



World Educational Crisis. A System Analysis. Oxford 1968. 

How to manage your Information. New York. Relnhold Book 
Corp. 1968. p. 292. 

Die Weltblldungskrise. Stuttgart 1969. p. 187 and the 
chapter: Teacher - a problem of quality and cost. p. 50-61. 

Asks "For the proposed system of education we are In lack 
of teachers. This objection reveals a serious difficulty. 
We are Indeed in lack of teachers, especially of teachers 
for information science". (Der Blbllothekar und seine 
Ausbildung. Zeltschrlft fur Blbllothekavesen und Biblio- 
graphic 16, 1969. p. 168). 

The complexity of the modelling of Information system. 
(Journal of Documentation 26, 1970. p. 204-220). 

Access and recognition. From users' data to catalogue 
entries. (Journal of Documentation 26, 1970. p. 230-249). 



534 



GRADUATE EDUCATION NEEDS OF INFORMATION SPECIALISTS 

Dorothy B. Lilley 
Professor and Head 
Department of Library Science 
East Texas State University 
Commerce, Texas, U.S.A. 



SYNOPSIS 

This study identifies the graduate education needed by the practicing 
information specialist: (l) subject reference and subject bibliographic 

control, including control of current information; (2) research in 
information problems; (3) information center administration, including 
cost analysis; and, (4) design, evaluation and use of information systems. 

Introduction 

Administrators in ten selected scientific information centers in the United States 
were interviewed in 1964 ( 5) • The centers, representing eight different scientific 
and technical fields, were selected for their use of sophisticated, non conventional 
information systems. Data were collected by means of a structured interview 
schedule. These data were supplemented by those from interviews with officers of 
professional organizations in scientific, technical and special library and 
information science fields, and by reading and consulting during the next five years 
while analyzing and interpreting the extensive data. 

The purpose of the study was to find out (l) to what extent information centers were 
different from libraries and different from each other; (2) what activities 
information specialists performed or in which they were most frequently involved, 
for it was, in fact, found that several different types of professional personnel 
were involved in most of the information center activities; (3) what knowledge 
information specialists needed; and, (4) what part of the knowledge that information 
specialists needed required formal education. 

Apparently by selecting outstanding centers in terms of their sophisticated 
information systems, and studying these in detail, trends were identified that have 
prevailed in the educational needs of practicing information specialists. Factors 
which affect the kind and extent of the formal education that information 
specialists need are: the environment in which the information specialist works, the 
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activities in vhich he is involved and the areas of Knowledge that he can learn on 
the Job. 

Environment 

The environment in vhich the information specialist vorkB is different from the 
university library environment. Three types of information centers that are 
different from libraries and different from each other were identified. The 
differences between the information centers and the libraries were attributable 
mainly to differences in input, i.e., personnel, materials and equipment use, and 
did affect a difference between the education needed by librarians and the education 
needed by information specialists. The differences in input between the libraries 
and the information centers were (l) several different types of professional per- 
sonnel were employed by information centers: computer specialists, scientific and 

engineering personnel, information specialists, librarians and adslnlstrators, while 
in the libraries there were only llbrarlsns and administrators; (2) several types 
of documents different from those in libraries or no documents at all were input by 
the centers; and, (3) Input of computer and other equipment was widespread in the 
information centers, but not, at that time, in libraries. The differences miong the 
information centers were largely differences in distance from and relationship to 
the users, some being totally remote, others working with their users in a common 
endeavor and others acting as liaison officers from sources of information to the 
user, who in these cases, were research scientists. These differences did not 
appreciably affect the activities of the information specialist in the different 
types of centers, not the Kind of education needed. 

Although this study did not expressly investigate the dlfferencee between the output, 
i.e., services, of the libraries and information centers, it is known that there are 
dlfferencee in form, e.g., print -out from microform; method of serving the user, 
e.g. , selling or giving the output rather then lending it; and, differences in 
relationship to the user, e.g., searching out and delivering information to the user 
rather then requiring that he select and retrieve it himself. Some of these 
differences are more a matter of degree then of types of services. One of the 
principal differences between libraries end several of the centers Is that some 
centers operate for profit or to break-even. This le a trend that has been develop- 
ed to a fine point in one center investigated in 1970 by a student at last 

Taxas State University (7). Using the earns interview schedule which was used in the 
original study, he found that all of the services of the center are on a charge 
basis with a subscription daposlt against which all services are charged. 

Activities end Areas of Knowltdgs 

The activities in which the infoxmstlon specialist is involved end the areas of 
knowledge which be needs to know are closely allied. Seventy-one activities and 
thirty-three related areas of knowlsdge were surveyed. In the majority of the 
centers, information specialists were found to be involved in fourteen of the 
activities and to need formal education in thirteen of the thirty-three related 
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areas of knowledge. The Information specialists were considered to he relatively 
more competent in the activities and areas of knowledge related to design and 
operation of their own information systems : coding index terms , developing 

terminology authority, information system design, transforming user questions into 
the terminology of the system, evaluating materials retrieved, coordinating elements 
of the information system, retrieval by logical operations and selective dlssemlna- 
' tlon. They were less competent in activities and areas and of knowledge related to 
subject bibliography and reference, research in information problems and Information 
center administration, including cost analysis. The former are the areas that have 
tended to define Information science. Several engineering and technical schools 
have developed educational progrvw including these areas. 

Graduate Education Heeded 

lfce principal contribution of this study is that it revealed (l) that the function 
of the information specialist is largely locating and transmitting subject infor- 
mation to the user; (2) that knowledge of subject bibliography and reference and 
bibliographic and subject matter control is the greatest felt need for the education 
of the practicing information specialist; and, (3) that therefore, the progr« of 
education in information sclencs which does not Include the study of guides to and 
•ourcss of the literature of a subject. Including the available subject data banka 
stored on magnetic tape or disk, cannot be considered sdequate education for the 
practicing, professional information specialist. This fact contributes to estab- 
lishing ths library school as the location for education for the information 
specialist. These findings have been obliquely confirmed by Klempner (k) who 
reported in 1966 that the information from several centers was isolated and poorly 
disseminated, implying that information specialists would benefit from knowing of 
other information besides that produced by their own centers, end by Taylor (3) in 
1967, who, although not affillatsd with a library school, recOMended that library 
schools change their curricula to parallel the growth of the information sciences 
end that graduate study in the information sciences be imbedded in research. While 
library schools have traditionally accepted the responsibility for education in 
subject bibliography and bibliographic control, they have little tradition in 
research end evaluation testing the status quo and still less, until recently, in 
design of nonconvent lonal information i/atmi. The problmi has been, ee Muller (l) 
noted, that library education has been taught as "Bible-truth" while it should be 
taught as "an area of knowledge end theory that la subject to continuing reappraisal 
end renewal." The design of new information systsms was a major break frcm the 
tradition. It baa incited a new type of service end a new type of operation. This 
study found that afelnistratlon of such ssrvices through information centers pieced 
major enphaals on cost analysis. This is understandable since there are a nusber of 
centers which operate for profit. At th* time of this study there were very few . 
courses being offered in the afeinlstratlon of information centers. Ths content of 
such courses is a matter of current concern (6). 
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Conclusions 

The principal areas of knowledge that require foraol education are those related to 
subject reference and bibliography; research in information problems; information 
center administration* including cojt analysis; and* information system design* 
evaluation and use. 

Obis study, although based on data from scientific and technical Information centers* 
predicted widespread application to other fields. Wider application is Indeed the 
current prognosis, not only in information centers* but in libraries (2)(6). 
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SELECTION, EDUCATION AMD TRAINING OP PERSONNEL 
Chairman: Prof. J. Gross ( Israel) 



MOTV H. BORXO (US): So ms of you mm y bt familiar with t curriculum that wtt published 
la "Computer Sdtnctt 99 bp tha Aaaoclatlon of Computing Mach Inary. Thla had a grant 
Impact on developing a aaparata diaclpllna of computer aclanca In tha US and through- 
out tha world. Tha Aaaoclatloa la now working on anothar curriculum for a dagraa 
called "Information Analpala and gpatema Daalgn". Thla will ba published aftar It has 
game through all tha nacaaaarp reviews, which will ba In alx non tha to a year. I 
think It will alao make a great Impact and there la a need for dlecueslon of It from 
our point of view to that the now curriculum, whatever It will ba, will not ba fine- 
Head before wa have got our point of view across. 

HRSm L. VIl£MTCHVX (Israsl) : I want to touch on tha shortcomings of moat training 

programs In Information aclanca cad the reasons for these shortcomings as I see them. 
Tha training of information workers in moat countries, Including Israel, la like an 
upsid e down pyramid with emphasis on theoretical subjects. What, In effect, are wa 
dolmgl Va era trying to produce research workers and theoreticians lnetead of pre- 
paring a cadre of useful and highly qualified technicians. When 1 eey technic lane, 

I don't neceeeerlly moan non-graduate workers. In my terminology, technicians are 
ell thoee who practice their profession as opposed to thoee engaged In reeearch and 
development of new techniques within their professions. They' nay be non-graduatee, 
or os a higher level, graduates with Bachelor's or Maatar'a degrees. 

University schools In Information aclanca era sprouting ell over the world. The 
curricula of moat of theee schools either Impart knowledge In library techniques only, 
er, going to the other eatrama, amphaelac purely theoretical subjecte, euch ae mathe- 
matical theory of communication, operations research, ate. Evan woree, In my opinion, 
they enphaelie tha etudy of computer technology as a subject In information aclanca, 
disregarding the fact that am Information scientist should ba the user of library and 
computer services and not their operator. 

Tha etudanta are fed a lot of inapplicable knowledge, rather than being given the 
tools for tha intelligent practice of the profaealon whose object le to eerve ae tha 
liaison between the generator end the ueer of information. 

Graduates are often unhappy and frustrated because their work after graduation 
does not measure up to the high expectations evoked by the printed curricula of tha 
echoole. Moreover, the echoole mislead then into thinking that tha attaining of thle 
or that degree gueranteee proficiency on the Job, whereas proficiency on the Job re- 
quires long years of experience. The result la that the intellectually honest, the 
gifted, and those poeaeeelng Initiative and personality are topted to leave tha 
profaealon. 

The other weakneea la the training of tha profaealon la tha criteria by which ap- 
plicants for training are selected. Information science la a eervlce profession, 
therefore, the personal qualifications of its members le of paramount Importance. 
Unfortunately, no reeearch hae been dona yet on what personal qualifications are needed 
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In ■ coapetent information aclentlet. Beceueo of tha nawnaaa of tha profeeelon than 
la probably not enough data on which to baaa ouch raaaarch. However, coaaon aanaa 
and tha experience of nanagara heading Information unlta may provlda a oat uaaful 
guidalinoa. 

Ha need, primarily, all-around, goed ganaral practltlonara, to borrow a tarn from 
aedldna; paopla with a broad ranga of lntareata, a eound ganaral education and epe- 
dallsad education In tha particular dladpllna In which they will work, People with 
intuition and tha ability to feal, think mid llwa othar people'a faellnga, theughta 
and naada; a cowand of aevaral languagaa, a balanced, not aggraaalvo pereoaallty; 
tha ability to ramambar facta thoy coma acr>aa even through chance reading or paroonal 
encountaro and aaaodata than at tha right nonant with tha problem at hand; paopla who 
pay neticuloua attention to dataila, but wh*> navarthalaaa diacorn tha aaaaaca of a 
problem and, finally, paopla who are articulate In oral and written preeentaf lea. 

Vary ram qwalitlee Indeed. 

The boat aga la Juat ovar 30; at that ago atudenta ora atill young, but mat youth- 
ful; tha wary young ara generally aggraaalva, ogoiatic, aolf-comtarad, unable to wn- 
darataad othar paopla 'a paychology, naada and vlawpointa - all quail tlaa thick ara 
vary good la none profaaalooa (a.g. tha nilitary), but mat aultabla far any aarvlca 
profaaaloa. 

Oaa loot point. Thera ora profaaalooa which require from the eutaat a grant 
Mount of theoretical know lodge, ouch aa, for laatanca, chaniatryt tha education of 
a chanlat today la not coaploto if ha haa not acquired a Ph.D. dagroa. On tha other 
hand, than ara profaaalooa ilka engineering or Mr own prefoaaloo and many a thorn, 
which require yaara of practical aapariaaca before additional theoretical haMledga 
la dlgoatabla, uaaful and deeirabla. 

I therefore believe that achoola with a Ph.D. program la iafamntioa eclaaco are 
pronatura. Tha elite of our profaaaion, after yaara of aapariaaca (in tha Jab, will 
develop new technique a, do raaaarch and open maw viotaa far tha prafaaaian with or 
wlthMt a Ph.D. degree. Thaaa actlvitioa ara not to be engaged la Jno tha ochoel 
bench. 

map. M. gifgd (India)} India la a Padaratloa of eight aaa atatee where aducatiaa 
■ la a etata oubjact. Library education la tha raapaaalbility at the individual atataa, 
but the Central Ma idtry of Iducatloa layo down tha national policy far tha aatira 
country. The Dalveralty Craata Canal aaloo aata the norma Ilka tha naivaraity Ornate 
Coualttee la tha OK. 

Than era 34 naivaraity library achoola la 14 atataa. la addition, wa have two 
internet 1m eciaaca acbaola which offer apeclallead tralalag la docunaatatiaa ad 
lafaraatiaa aciaacaa. 

Tha equivalence at tha dagraaa la library eclaaco la varieua couatrloa la a problon 
and thia la tha right forum to diacuaa it. A Heator'o dagraa program la np> country la 
a two-year, fnll-tina prof aaa lanal ceurae given after tha canplatiaa of tha flrot do- 
gma, while la naay cawatriaa a Maatar'a dagraa progran la only aaa year after gradu- 
ation. Dagraaa awarded by the lrltleh Library Aaaaciatiaa and by tha Loadaa tchoal of 
Librariaqahlp and othera ara not to c ognltad aa an oquivalaat ta a Maatar'a dagraa la 
tha g|. Inch lacoagatlbilltlaa craata prablaaa far atudenta wha would llha to go to 
atber cawatriaa far higher aducatiaa, Itandardleatleu af dagraaa la eecaaeary. 

Tha fallowing aatract coacaraa tha aquivalaaca of profaaaioaal dagraaa la Canada, 
tha Uf and tha OR. 1 quota an attract from, "Baffling variety t educational nothada 
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for llbrarlanahlp In Groat Britain, Canada and the United Statea”, A. Roberta Rogera, 
Coll, and Rea. Llba., 23{1)» Jan. 1962 t p. 45-50. 

"In tha eummar of 1959 an informal committee conalatlng of J.C. Harrlaon, Head 
of tha School of Llbrarlanahlp', Mancheatar Collage of Technology; George Pitcher, Li- 
brarian of tha Rumaal College of Technology, Ghana; Bartha Baaaam, Director of the 
Unlveralty of Toronto Library School; Letter As he la, dean of the Unlveralty of Chicago 
Graduate Library School; and Robert L. Gltlar, executive aacretery of tha Library Edu- 
cation Dlvlalon of ALA, mat In Urbana undar tha chalrmanahlp of Dr. Lancour. 

H The coMlttaa ra cotn dad the following tabla of mlnlaum raqulremanta: 



Canada 



a) Bachelor 1 e dagraa from an approved lnatltutlon. 

b) i.L.S. dagraa from a library achool accredited by the ALA 
and CLA. 



U.S. 



U.K. 



a) Bachelor' a degree from an approved lnatltutlon. 

b) M.S., M.A., H.L.S. (or almllar dagree) from a library achool 
accredited by tha ALA. 

a) Bachelor' a degree from an approved lnatltutlon. 

b) One year of atody In ona of tha ten library achoola approved 
by tha Library Aeaoclatlon. 

c> Poaaaaalon of tha Aeaoclataahlp of the Biitlah Library 
Aeaoclatlon. 



"Theae propoaala mark a rani atap forward In tha matter of International library 
cooperation, although they do la/.va a few quaatlona unanavered. For axaapla, no 
mention la made of the recognition to be accorded tha holder of a Brltlah unlveralty 
dagree pine tha FLA or the Academic Poet Graduate Diploma In Llbrarlanehlp of the 
Unlvaralty of London. By lapllcatlon at laaat, the a a qualification are ranked 
aomewhara above the minimum, perhapa at the level of the Canadian M.L,S. or the old- 
atyle alxth year American maatar 9 a degree". 

Work haa already .been dona on tha formation of an International Aeaoclatlon of 
Library Schoola, and we have been In correapondanca with IFLA. Ua have about 35 
library achoola who are lntaraated In forming an International Aeaoclatlon of Library 
Schoola to dlacuaa library education and training program.. Va ahould taka aome da- 
cialon on thla laeua. 



Tha next point to be coealdarad la the exchange of taachara In library aclenca 
In varloua countrlee, provlalon of fallowahlpa for foreign atudanta In aalactad li- 
brary achoola, holding profaaalonal International aamlnera on library education, 
analyala of varloua couraaa In library achoola, etc. I would Ilka to Invite your 
euggaatlona on theoa laauea which are praaalns problama for library education In 
every country. 



PROF* V* SSSSIORS (US) : The profaaalonal development program at tha City Unlvaralty 

la intended for people who are practlalng llbrariana and for Information apaclallata 
who want to learn aone of tha newaat tachnlquaa. However, tha program waa daalgned 
with a broader goal In mind. Va were trying to look at what llbrariana and Informa- 
tion a aclentlata In general hava to know, not juat what prectltlonara have to know. 

Va may, aa we go on to tha doctorate, have an Intermediary atap. It la poaalbla 
that wa will decide to have a broader dladpllna called Information Science, which 
will ba different from Computer Science. 1 did hava aavaral dlacuaalona about tha 
eventual Ph.D, degree in Computer Science with tha Head of tha new Computer Science 
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Program at the City College. It would be a travesty to glva a Ph.D. In Computer 
Science to somebody who will only have the level of proficiency In Computer Science 
necessary for people who practice In libraries. Information centers, data archives 
and such Institutions. We feel that library or Information science generally, and 
computer science, are quite different disciplines. 

MR* tf. SCHVR (UK) : I will deal with four topics. First, why do we bother with edu- 

cation and training programs? As has been said before, there are qiilte a number of 
people who have no formal qualifications In Information work or llbrarlanahlp and 
yet are quite successful. 

Second, If we have training programs, what are thalr objectives? Some of the 
dogmatic statements heard this morning stem from the fact that the objectives. In 
the view of the person who stated them, are very specific, whereaa objectives which 
might exist In other environments are not taken Into account. 

Third, there are constraints. It la of no uae Juat comparing programs In dif- 
ferent countries because the conditions under which theaa programs had been developed 
are different. I am not trying to aay that no two programs can ba similar, but that 
we ought to know a little bit more when diacuaslng programs, about the constraints 
and Incentives under which theae programs operate. 

i 

Fourth, what criteria exlat for evaluating programs? Thera ought to ba soma sort 
of feedback on whether theae progrema are really useful. 

I feel that there are really two major objectives. First, thara are educational 
objectives which, so far aa I have aeen In the literature on educational training 
programs In thla field, seem to have been completely forgotten. Tha objective of an . 
educational program la educational. Thla nay aound a platitude, but It la absolutely 
eaaentlal. By educational I mean Bloom f a reasoning on education objectives. You will 
aee that thla takes Into account aome points Ilka knowledge, which Includes tha 're- 
call of facta and generalizations of methods and proceaaaa, patterns and atructuraa. 
Then one goea to tha much more difficult level of comprehension, publication, appli- 
cation, analyala, synthesis, and finally evaluation which Includaa Judgments. 

I am afraid that many programs seam to concentrate largely on tha knowledge part 
bacauaa It la vary aaay to teat this. It la very difficult to taat evaluation Judge- 
ment, because people. have got thalr biases and It la difficult to agraa that somebody 
has mads a good caao If one holds a diametrically opposed point of view, lowevar. It 
la the function of a teacher to be able to do this. So knowledge Is the thing which 
tends te be straaaedf and paopla era walking encyclicals. 

Sacoodly, thara are professional or utilitarian objactAvaa. Obviously peopls do 
not attend couraaa Juat for tha fun of It. Thare are four v^ajor types of utilitarian 
or professional objectives. Thara are thoae of tha indlvldu<\l concerned, there are 
academic ones, thoae of tha profession, and finally the national objectives* which 
era vary pronounced In a country like Ieraal. 

It a seme to ma that thara are aaaentlally four catagorlaa of work. First there 
la tha production and operation category. Thla raqulraa paopla who know something 
about tha aubjact they are working in - tha subject specialist. Thaaa are tha practl- 
tlonara. One alao requires system operators. Vary many librarians era Juat of that 
type. They know how to run tha library. 

Increasingly Important, I think, are people who are capabla of doing development. 
On a knowa of libraries which become foaaillzad. Development haa to ba continuous. 
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Applied* riMirch, which does not deal with the toplca which bring preatige to the 
individual! are the toplca which are eaaentlal to the country. Unless they are dealt 
with, the work won't gat dona. 

Background reaearch, which la slnoat pure research, appears to be essentially 
a cad sole. I do no think one can really prescribe what sort of thing ought to be re* 
searched because this la aonethlng which cones free the individual himself. All that 
can be done la to state the sort of problem which exist a, at a national level, and 
hope that a one people may. get interested. It la no use coming around and saying "you 
will work on this*" Reaearch la like the work of an artist. An artist does not nor* 
mally produce works which have been dictated to Mm, 

What are the criteria which ought to be considered? First, effectiveness; second, 
efficiency; and third, the benefit*to*cost ratio. Educational effectiveness: how ef- 
fective la the course? Professional effect iveneaa: la the output of the prograa use* 
ful? It it actually used? Do the people in fact get jobs? Efficiency la essentially 
the graduate-to-coet ratio* It la certainly not the output-to- input ratio. One could 
easily pass everyone end be 100% efficient, but quite ineffective, Benefit-to-coat is 
extremely difficult to assess. There ia a lot of discussion in educational circles 
about the benefits of education in general. 

PROF* M* KOCHSB (US): To Prof . Borko. Tony Etlnger of Havsrd has questioned whether 
it is dfeirablaU) have a separate computer science department; the same thing could 
be asked about information science. I wonder whit your views on this question are? 

PROF. BOBKO : To Prof . Kodhon. This depends upon your emphasis. We could emphasise 
it in our library schools; for example, UCLA gives a special degree in information 
science in their library school, and grants two separate degrees: Master in Library 
Sclaoce and an M,8c. in Information Science. UCLA will give degrees in information 
aclance within the Computer Science Department and the School of Management. My own 
view is that there is enough of a common body of knowledge in information science and 
particularly in the systems analysis aspect a to deserve a separate degree program. 

PROF* StSSIORS: To Prof* Borko . Logic does not always determine where a program la 
located. Sometimes it la a matter of political or administrative necessity. Let us 
be honest about why some programs are called by certain names and why they are OK 
where they are. There are some theoretical reasons, but there are also some pragmatic 

reasons, 

DM. V. ROTHSCHILD (loroBl) : To Prof , Borko , What are the undergraduate requlr«ents 
for the different programs? Can one enter one of the computer programs or informs* 
tlon science programs with a BA in the humanities? 

PROF . BOBKO: To Dr. Rothidhitd. This la a very important point. At the moment we 
see the level of technician as a terminal type of degree and the student will probably 
coma from mi equivalent of a high school education, maybe with some work experience 
end possibly some technical training. In Germany, this la the equivalent of a degree 
from the Deutsche Gesellschaft fuer Documentation, an institutional type of degree. 

For the level of Information Specialist, requiring a Masters Degree, there la an 
area of apecla Illation, The person who wants to work in science a should have a 
background in sciences. If he wants to work in education or linguistics, he should 
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certainly have a knowledge of more than one language; If he will emphasize management 
he would need a Bachelor's degree In management. He would also need, and this la a 
personal opinion, a fairly strong background In mathematics. These are the require- 
ments of my school. For the doctoral level, he will have to do research work and 
this would have to be Interdisciplinary. 

DR. M. KESSl£R (US): To Prof « Borko. ; This la not a question, but a request. Those 
In the profession who occupy themaelvea with matters of education should consider 
that the field is new and, therefore, by necessity, practitioners In the field have 
not gone through any formal education guided by our present understanding of what 
education ought to be. It Is fair to say that this condition will prevail throughout 
the next 5, 10, or 20 years. 

I would suggest that the people who give thought to the matter of education also 
concern themselves with the problems of education of the present practitioners whose 
circumstances are quite different from those of a student who attends claaaea from 
9 to 3. They work all day and have to take evening courses. I think that considering 
the state of our technology this educational problem la of great Importance and I 
would be very happy If the educational 1st a In this field would give some thought to 
It by organizing courses or even writing textbooks. 

PROF. BORKO: To Dr. Kessler. I am glad to aay, unequivocally, that people In all 
countries of the world are giving a great deal of attention to this and Indeed moat 
of the development haa come first from In-houas service courses, later from special 
technical schools and only then have they been adopted by the universities. Tha need 
to retrain and to continue the education haa grown out of tha technical lavel and 
there la still a major emphasis on this In all tha countries of tha world. 

PROF. H. ARMTZ (Gemanu) : I would like to add something from the point of view of 
the International Federation for Documentation (FID). We are In the procaaa of set- 
ting up a new program and new statutes and ws deliberated In Moscow whether to retain 
the word "documentation" In the nme of the organisation or to change It to something 
new. As this proved to be Impossible, ws decided to stick to the old name Interna- 
tional Federation for Documentation with tha understanding that "documentation" In 
the name meant Informatics, Dokumentatlon, Documentation, and Information Science, 
singular, but at the same time we agreed that our "documentation" was part of a bigger 
science called the Information Sciences, plural. Other sciences either belong to this 
group completely, e.g., library or archive science, or In part, applied linguistics, 
theory of communication, computer science (Informatics), and so on. 

PROF. GROSS: I think It la a tour de force to gat a form of Esperanto Into the name 

of an organisation; you seem to have succeeded In that! 

DR. ROTHSCHILD: I am glad to endorse at least two of the points which have bean made. 
First, the Importance of selection of trainees as well aa of teachers. I am extremely 
glad that Mrs. Vllentchuk pointed out the need for general background, not only In the 
limited field of science or technology, and I aa even ready to agree to the point she 
made concerning the desirable age. I like to have In ay claaaea at least some stu- 
dents who have accumulated experience In life, served In education, etc. 

The second point Is service versus research. Our universities here are rather 
conservative. Only next year I hope to get the right to grant Master's degrees. 

Even today members of the University Senate doubt If ours is a scientific profession, 
and It la especially difficult to recruit science graduates. 
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Before the opening carMony I had a talk with Dr. Penland and he acked ne how 
many llbrarlana from public librarian are at thla Conference. 1 had to answer - very 
few. But public llbrarlana alao must have basic knowledge about all that Is dealt 
with at thla Conferenca. Concerning the recruiting of science graduates, 1 should 
like to mention that Mr. Lsbowltx la more optimistic In thla reapect than 1. He In- 
tends to Interest ac lance graduates at the Technlon In joining a Master's program In 
library science. 

In the States there la a five year program without any relation between the Ba- 
chelor's program and the one year course In llbrarianshlp or Information science. 1 
think It would be desirable to shift some of the basic subjects to the undergraduate 
level without giving Bachelor's degrees In Information science and llbrarianshlp. 

PROF. AENTZ: Va sometimes simplify the problem If we do not .think of the persons 
whom we are going to train. What we really want are persons able to serve user pro- 
files. Thla means they must have experience In the fields In which they will serve 
thalr users. 

On the other hand a Mrs. Vllentchuk said that It Is difficult to recruit people 
over thirty. Thla la absolutely true as I have found many tines, but we want the 
bast chemist to serve a chemical user profile. This means that ws take him sway 
from chemical work where he could make discoveries and become famous. We want the 
best medical doctor for medical profiles. We take him sway from his medical work 
and so prevent him* from becoming a famous surgeon. Only the best people In the filed 
will make the bast information officer*. 

So vs must give a lot of Incentives to those whom we want aa scientific Informa- 
tion officers. Questions of pay, social status, general recognition of Information 
work, and so on srs Involved. And this In addition to all the problems of curricula. 

MR. R.B . ZAA1HAM (South Afriou) : To Prof. Amtx, In our experience, Information of- 
f leers are normally not people vho are extremely successful In research or who find 
rasssrch vary Interesting. It Is not those people who become good Information offl- 
cara. Good Information officers normally start off with a highly Idealletlc view of 
science and research and bacons disillusioned. They then turn to Information work 
In which they find mors Intellectual stimulation. 

1 think research Institutes which have funds available can make a study of the 
psychology of tha successful Information officer. At the moment one has to find 
theaa paopla by trial and error. There Is no way of actually selecting them accor- 
ding to psychological taata and one only lsama by experience which people srs likely 
to be successful and which are not. 

DR. BROW (CanacLx): I would like to confirm what Mr. Zaalman just said. In 

Jteada during che past six years vs have operated an In-training program sponsored 
by the National Science Library and a scholarship program, In an effort to encourage 
scientists to corns Into the field of science Information. We have attracted some 
very good people, but on the whole the applicants have tended to be people who ere 
not doing tee well as scientists and have decided that they have to get Into another 



Aa a result of developments In Canada, wa are now undertaking a vary detailed 
study of what type of training Information scientists require, Ws have a team headed 
by a professor from Dslhousls University which Is trying to find out what the training 



field 
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needs are 9 taking a look at the situation as It now exists In Canada, evaluating 
whether It Is adequate or not, and If not, what should be done about It. 1 suspect 
that the needs In Canada are no different than In other countries and the team's re- 
port, which will be out within the next three or four months, will be of Interest to 
other countries too. 

DR. A . COCKX (Belgiwt) : This discussion has emphasized the difference between the 

Anglo-Saxon approach and the European approach. The American approach over-emphaalzea 
the Information scientist as against the subject specialist. I cannot accept Prof. 
Borko's view that a Ph.D. In information science Is a qualification for work In a 
chemical Information center. 

Dr. Gezellus has mentioned that llbrarlea are creations of the nineteenth century 
and documentation centers are the creations of the first half of this century. When 
we enter the second half of this century, the dialogue In Information transfer la 
only possible when there are top qualified people on both ends* If you do not have 
this dialogue your Information center Is a complete failure. 

PROF. GROSS: I cannot think of the future of Information as a mechanism without 

highly trained people at the Ph.D. level. 

PROF* BORKO: To Dr. Cockx . I do not think there Is any disagreement at all. What I 
was trying to describe are the different needs within the broad flald of Information 
science, and the different levels of training necessary to meet these needs. 

I agree completely that for the service oriented degree there la no need for the 
Ph.D. It la necessary for a research and teaching career to advance the state of the 
art In communication, in Information analysis and dissemination. Whether the same 
person can do both, whether he will progress by experience from the service degree 
and then come back to the university because he has a real raaearch problem and wants 
to be able to pursue It, la a different matter. We should not make the mistake of 
trying to force all education Into a single mould. The field aaema to be developing 
Into various types of orientations and various levels of competence. 

Not everyone la going to go Into management and Information centers. It la.for 
the university to decide whether all those going Into llbrarlanahlp or Information 
work should have courses In coat accounting, in mathematics. In operations research. 
There are different levels of specialization. It would be a mistake to aay that a 
particular curriculum la the only curriculum. 

MR* ZAAIMAN: To Mrs* Sessions . You said in your paper that the Qity University re- 
quires those students who do costing to have quite a good background In mathematics 
and statistics. That Is very true if one works at a doctoral level, but I think that 
any librarian who wante to do coating would ba very lucky If he had the very special- 
ized' training In costing and statistics required. In most Instances the services of 
specialists In these fields could be called on. 

PROF. SESSIONS : We feel that a certain basic elemental understanding of coating and 

management Is essential to almost everybody at some point In his career. Perhaps nor 
for the first degree, but certainly later. I have been asked to chair a seminar of 
the Library Association on new developments In llbrarlanshlp, for the City University 
next spring. We have 276 llbrarlea at the university. We asked ourselves, once they 
know about new developments, how will they be able to apply then in their own libraries 
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and what will the effect be on the university library system? So we decided to make 
this Into a 2 year program. The first year Is devoted to matters that we feel almost 
everyone should know, but Is not aware of, and therefore, not really capable of making 
the kinds of decisions which should be made on the low level of the hie; archlal 
structure. 

PROF* ARNTZ: We in Europe were unhappy when the Americans changed documentation to in- 

formation science because the word scientist here takes on a different meaning from thar 
which it normally has. If Information science is a science, and we are convinced that 
it Is, then we need information scientists who are scientists in the field of informa- 
tion science. The man of whom I previously spoke - the best chemist who Is to serve 
user profiles in chemistry, the best medical doctor who has to do this in medicine - 
he, of course, Just as Dr. Cockx said, must have studied chemistry or medicine and In 
addition, after practical experience he must have studied Information science, but 
this la not his subject field. The tun who runs a big Information or documentation 
center, In a chemical plant, asked about his profession will always answer, ,? I am a 
chemist", but, of course, he la also an Information man. 

So we in Europe and in the International field distinguish between the real sci- 
entist In Information aclence whom, as I said, we badly need, and the man who serves 
user profiles. Ha has studied a particular subject but afterwards he certainly has 

studied Information science as well. 

• . 

MISS M* ATIR ( Israel) : We forgot one very Important thing and that Is the public. What 
la the Image of our profession from the point of view of the people we serve? I 
atudled at Columbia University in the States and when I started working in the New 
York Public Library people asked me - what is a librarian? Do you need to study to 
become a librarian? I hear the same question in Israel. The public does not know 
what our profession le. How can a student want to become a librarian if he does not 
know what It le? 

PROF* GROSS : I can talk aa one of the users. I am not a librarian. I use libraries. 
Research people learned very quickly that they waste an awful lot of effort in dupli- 
cation if their interaction with the information source is not adequate. 

MRS* E* EHRLI CH- DE VRIES (Iorael) : The overlapping disciplines demand an enormous 

amount of special knowledge from a librarian and I wonder whether in a special libra- 
rian^ training it would not be possible to have two yeara of general education in 
addition to the llbrarlanshlp subjects and in the third year specialization in the 
field in which the student would later like to u$rk. A degree in civil engineering for 
example, la not much help to somebody who will later worl*. aa a librarian in medicine. 

If, on the other hand, in the third or fourth year the student learns a special disci- 
pline, it might afterwards give him an enormous advantage in finding work. 

PROF* BORKO: To Mrs* Ehrlich* This suggestion is being followed now by a number of 
institutions in many countries. Let me speak for a moment about my own institution. 

We started with a one year Master's degree, and immediately recognized the problem 
that you raised, the need for specialization. A medical librarian needs different 
training at the specialization level than a military or a tare book librarian and ao 
a second year has been added, which is a sixth year to the whole program. 

It now 'includes a four year Bachelor's degree, which is the basic first degree; 
then the one year Master's degree in llbrarlanshlp; and then the sixth year for a 
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certificate of apeclallmatlon. low, UCLA felt that we should sake It wore general 
so we are beginning thlo jeer o 2-year Master's program* The first year of our 
Master's prograai will deal with general subjects In llbrsrlanshlps basic work In 
classification bibliography B reference and nanegensnt; and the second year, after 
the student already knows whet he sente to do upon graduation, with fpeclsllastlon* 
Many other schools ore considering this sane approach. 

• 

PROP. GROSS: TO Prof . Barko . It cones to alnd that with 6 years of training you ers 
getting pretty close to a nodical degree and 1 wonder why an Internship in an appro- 
priate library Is not Included. 

PROF. BOBKO: To Prof * Grom * • Our 2-ysar degr e e prograw doss Indeed Include a sinner 
Internship progran In the second year, the question Is whether It Is worthwhile, 
cost affective, to go to the extra year! Is there n lncrenent In aalary consensu- 
rafts with the coat of that trelalx*! At the non nit the thinking at our Uhlvarelty 
la that Jobe ere difficult to get and that a per eon with an extra year training 
would be wore readily selected for a Job, other things being equal, a^d we hope that 
this person will start at a higher salary. 

PflOF, SESSIONS z There Is, of course, a great deal to be aeld for the Idea that there 
ahould be specialised training in nodical librarlanahlp, engineering llbrarlanablp, 
etc. 1 have had a great deal of legal background, so 1 obvloualy took e course In 
legal bibliography and In the social sci en c e s , how ev er. In the 08 there la e great 
deal of sowing arouni fron one type of library to another. X think that we are 
doing a disservice to librariana by Uniting then to one type of library. If libra* 
rlanehlp end information science is e discipline In ltaalf , then there ere basic 
techniques In information science which can he applied no natter what the subject 
field la. 

W. SCRUB: 1 have In fact put this into ay paper when talking about professional 
objectives. In designing a progran consideration aunt be given to the career axpec* 
tat lone of the graduates, which nest not be woree than expectations of graduates In 
related flelda, in terns of financial reward, prestige, interest in the subject, etc. 

Ve neat avoid developing progran which lead to dead-end Jobe, however great the 
demand nay be for such- personnel. On the contrary, we should a In at providing an 
education for flexibility. The education for flexibility will not only increase the 
snploynant possibilities of the individual, but also by exposing him In the couree 
to conplately new topics, will b r oaden his outlook end hopefully increaee hie Interact 
In the subject. 

The point 1 an trying to neks is that these special purpose couraaa for lav 11* 
brer lane, for medical librarians, and others, which in fact do exist In eons school a 
In the 08, era highly undesirable as a first basic program. 1 an etreeelng this ee 
a flrat program. One does not train a toothpaste organic chni^it, but expecte hln to 
taka a general cheuiatry degree, which includes organic. Inorganic and physical chan* 
latry, and only then allows bin to specialise. Ve ought to Is doing the sane. People 
need a broad background. 

MRS* riLBTCBUX: The atatua and Image of oar profession ere at praeent not good, but 
I have many colleagues who's personal status la excellent. Vhyf Becauee atatua la 
something which you don't gat by resolutions or with a high salary, but through your 
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personal qualifications and the usefulness of your work to the community. And here 
again I come back to the Ph.D. problem. Ph.D. training is probably necessary - but 
for whom - for us, for the professionals, to advance our profession. It is not ne- 
cessary, however, at the moment, for our users* For our users what is necessary is 
the technical level - people who can give really useful service. 

The Ph.D. programs may be necessary later, and I come always back to the example 
of Lomonosoff. Somebody who really has something in himself and has something to 
contribute cannot be held down through any external circumstances. Lomonosoff was 
a Russian peasant who lived at the beginning of the 19th century when there still 
was serfdom in Russia. He walked by foot from Siberia to St. Petersburg motivated 
by his thirst for learning* Bom a serf, he ended up as the first President of the 
Academy of Sciences of Tzarist Russia. Why? Becauoe he had it in him. 

We do need people who will advance the profession and they will probably need 
Ph.D.'s, but if they have it in them they will do it without a Ph.D. too. But we 
must not build an upside-down pyramid. We must first improve the technicians, the 
engineers of our profession who will be immediately useful to the public and will 
help to improve our image and our status. 

MR, D, ELAZAR ( Israel} ; I would like to make a comment regarding specialization. I 
think we have not taken into consideration the overlapping in the sciences. You 
can't take an information officer and say - you are going to be an information of- 
ficer in chemistry - without understanding something about physics, mathematics and 
possibly geology and engineering. From my own experience I can say that the fact that 
I was able to work in a general science library helped me later immensely with 
specialized information work. 

MRS. F, CESTAC (France) : I would like to comment on Mr. Reichardt's paper. We train 

students to be information specialists and we have a two year program in information 
science. The first year is devoted to library oriented subjects and to computer 
work. We insist on a dialogue between the trainer and the trainee and Apply a me- 
thod of group dynamics. 

We require from the students special competence in a specific field, such as 
management or systems analysis. In the second year we teach system analysis and 
management, but we also have special training in semiology which nobody here mentioned 
though it is very important. 

I have no scientific background but rather a literary or artistic one, and I have 
noticed that often documentalists have no artistic feeling at all and know nothing 
about graphical layout. This is an important point and we have had a great demand 
for it in France. We have also started courses in oral communications. 

PROF. KAVLA : In library education much depends on what the library schools and 

teachers are able to do. A good curriculum with bad teachers cannot produce good 
librarians. We should have good teachers with good backgrounds in the subjects 
they teach. Normally when we recruit teachers we don f t ask whether they have work 
experience in a particular library or a specific documentation field. This should, 
however, be considered when the teachers are being recruited. 

I am pleased to hear that UCLA now has a two-year Master degree program. In 
India* we have had two-year Master degree programs since 1948. 
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W. S. SCHWARZ (Sweden) : As you know, Martin Luther said that the way to hell is 

paved with good intentions. At various Conferences, not the least so in futurology 
in which I am working now, I hear these good intentions have never materialized in 
really good propositions. 

Now the question of usefulness is quite interesting* We say, "Let us be more 
useful". But I am not sure that esteem is related to usefulness because for instance, 
a garbage collector is one of the most Useful persons in society and yet the status 
of the job is not that high. 

I believe that the relation between information and the structure of aclence and 
of scientific growth has not been really considered. If information could go into 
science policy, which can be easily done, by a science index method or other similar 
methods, we could show how useful we are and extend the field in a direction which 
can then be related to esteem. 



550 



ERIC 




▲ 



SESSION SEVEN 

publishing and reprography 



193 



r 



o 

ERIC 



dutfcare 0/ Papers 



Georg Thiele 



553 



M.D. Martin, 

P. Simmons and 
M« J« Vernon 



579 



E» Offenbacher 



559 



Melr Doron 



587 



Horst Zuchel 



567 



Lowell H. Hattery 



591 



Dieoueeione 



601 



isu: 



DEVELOPMENT OF SPECIAL METHODS TO PRODUCE 
MICROFICHE ECONOMICALLY IN SMALL 
QUANTITIES. 



Georg Thiele 

Arbeitss telle fiir Reprotechnik im Institut 
fiir Dokumentat ionswes en 
Frankfurt / Main 



SYNOPSIS 

The increasing flood of literature in science and tech- 
nology makes it necessary to find new ways for storing 
records and information. Large documentation centres have 
been using microfiche successfully for many years. The 
technical possibilities of using the advantages of micro- 
fiche for storing information even in limited quantities 
are explained. 



Mit steigenden Veroff entlichungszahlen besondors im wissonschaftlich- 
technischen Dereich ist das Ende des Zeitabs chni ttes gekommen in dem 
Papier die einzige Materie zur Fixierung und tibermi 1 1 lung von wissen- 
schaftlichen Mitteilungen war. Neue Speichermedien werden gesucht, 
fiir die zwei Voraus setzungen wichtig sind. Dei einer Speicherkapazi- 
tat, die erheblich hoher sein mufl, als die des iiblichen Druckes mufl 
ein schneller Zugriff gewahrt bleiben, odor verbessert werden. Die 
Effektivitat einer Veroff entlichung darf jedoch durch eine Veran- 
derung der Publikat ions form weder eingeschrankt , noch erheblich ver- 
touert werden. Normale Druckschrift oder Schr eibmasch inenschr ift 
laflt sich auf Papier hochstens urn 50% verkleinern urn noch lesbar, und 
was auch wichtig ist, kopierfahig zu bleiben. Magnet speicher und 
Mikrofilm bieton sich an, da ihre Kapazitbt erheblich hoher ist, wo- 
bei der Magnetspeicher zwar die geringero Kapazitat, aber den schnel- 
loren Zugriff bietet, der Mikrofilm dagegen die hohore Spoich orkapa- 
zitat. Es hat sich in der Dokumentation bisher als sinnvoll erwioson, 
in Magnetspeichern keine Gesamt information zu speichern sondorn hier 
Ausziige in Form von Schlagwortern , Daten und Literaturangaben fostzu- 
logen, wahrend es der Mikrovorf ilmung vorbohalten ist, Gesamtinfor- 
mationen, seien es Texto oder Dilder, die als Information in dor ori- 
ginalon Deschaf f enheit erhalten bleiben, zu speichern. Die vorlio- 
gende Information wird verkloinert und in analoger Form archiviort. 
Man kann in diosem sachlichon Zusammenhang doshalb auch von Analog- 
speichern sprochen. 

Desonders oingofiihrt hat sich hierboi das Microfiche in soinen inter- 
national gonormton Abmossungen von 105 x 1^8mm, mit oinom losbaron 
Kopf und fiinf Reihon von jo 12 Aufnahmon mit oinom Vorkloinorungs- 
faktor von ca. 20. Ein solchor Microfiche hat in dor ganz normalon 
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Ausfiihrung cine Spcichcrkapazi ia t von mindcstons 200 000 Zeichen* 
Starkore optische Vcrkl einerungen sind heutc bei onderen Vorfilmungs- 
arten boroits in dor Anwendung und konnten auch die Kapazitat des 
Microfiche noch erlieblich erhohen. Die derzeit interna tional o Form 
des Microfiche, die auch ciner ISO-Empf ehlung entspricht, ist abor 
boroits so woit verbreitet, daft man crwartcn kann, allororts Benutz- 
ungsgcrate, d«h. Losogorate oder Riickvergrofterungsgera t e , vorzufinden. 
Beispicl sweiso in den Vereinigten Staaten erfolgt die Informa tionsvor- 
toilung in Form diescr Microfiche durch den National Technical Infor- 
mation Service, woboi ein Tagesversand von 4 000 Microfiche als nor- 
mal angogebon wird, Es ist nah eli egpnd , daft auch kloinere Inf orma tions- 
zontralen sich fur das Microfiche als Inf orma tions trag or cntschcidcn 
mbchten und zwar nicht nur well die Lesegeriite an viclen Stellon be- 
roits vorhanden sind, sondern auch weil es moglich ist, kleinere 
Microfiche - Sorien in die groften Sanunlungen zu integrieren. Diese 
Absicht scheiterte bis jotzt abor daran , daft die auf dem Markt be- 
findlichen Microfiche - Aufnahmeg era to mit ihren sehr hohen Preisen 
fur kleinere Di ens tstell on vollig unrentabel sind. Diese Gerate lie- 
gen in einer Preisklasse zwischen$20 und 35 000,-. Unsere Uberlegung 
gingen daher davon nus , ein (iera tesystem zu schaffen, das stufenma- 
ftig der Menge der herzus t ellendcn Microfiche angepaftt werden kann. 

Zunnchst wurde s icherges t ellt , daft l6mm Mikrofilm - Au f nahmsgera te 
und zwar Schrit tschal tkameras auf dem Markt sind, die genau dem in- 
ternational genormten Microfiche - Schritt von 11,75mm entsprechen. 
Zusatzlich hat datm ein Hersteller fur Mikrofi lmgera te , die Mikro- 
film GmbH, eine Tochterge sellschaf t von Agfa - Gevaert, fur ihre 
35mm-Mikr ofilm Schrittkameras , die in vielen Bibliotheken und Doku- 
mentationss tellen stehen, eine aus weehs elbare Kassette fiir l6mm- 
Film mit einem festen Schritt von 11,75mm geschaffen. Hierdurch ist 
es intcressierten Diensts tellen moglich bei g«?ringen f inanzicllen 
Auf wand S treifenf ilme im Microfiche - Schritt herzus tellen • Die Fir- 
ma Kalle produzierte nach unscrcn Angaben einen Montage - Tisch, in 
dem der Film auf die Lange von 12 Aufnahmen geschnitten und festge- 
halten werden kann. Eine von der gleichen Firma hergestellt Acetat- 
Haftfolie im Format von 10,5 x l**,8cm, bei der die Schutzschicht 
s treif enweise entsprechend den Zcilen eines Microfiche abgezogen wer- 
den kann wird dann auf den Montage - Tisch gedriickt und nimmt die 
Filmstreifen auf. Der noch fehlende Titel in der ersten Zeile des 
Microf iche , kann von einer Schreibmaschinenvorlago durch eine Schnell- 
kopie mit Diffus ionsentwicklung oder mit Hilfe anderer reprographi- 
scher Verfahren hergestellt und auf den Kopf der Haftfolie aufgetra- 
gen werden. Von den so f ertigges t ellten Master fvir Microfiche konnen 
nun im bekannten Diazo — Kopi erverfahren beliebig viel Microfiche 
kopiert werden. Eine solche Heratellung von Microfiche entspricht der 
internationalen Normung , da hier ausdriicklich festgelegt ist, daft 
auch die Kopien von Montagen, wenn sie der Norm entsprechen, und in 
der Qualitat nicht von anderen Microfich ekopien abweichen, als 
Microfiche akzeptiert werden. 

Diese Aufteilung technischen Arbei tsgang es hat den Vorteil, daft man 
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bei geringem Bedarf mit der Beschaffung oines l6mm Mikrofilm - Auf- 
nahmegerates fiir ca. 1 500 Dollar bereits im eigenen Biiro den Film 
herstellen kann, ohne Unterlagen aus dem Hause geben zu miiften. Der 
belichtete Film kann im einfachsten Fall an eine Zentrale gegeben 
werden, die diesen Film entwickelt, den Microfiche montiert und 
auch die Kopien herstellt. In Deutschland ist die Arbei tss t elle fiir 
Reprotechnik eine solche Zentrale, die derartige Arbeiten fiir andere 
Institute durchfiihrt, die aber auch neben einem Dupliziergerat ei- 
nen Riickvergrofterungsautomat en zur Verfiigung hat, mit dem sie bei 
Bedarf auch Hardcopies von den bei ihr deponierten Microfiche her- 
stellen kann. Die einzelne Di ensts telle , die bisher nur den Film auf- 
genommen hat, kann dann bei steigendem Bedarf, durch Anschaffung der 
weiteren Gerate die Herstellung von Microfiche immer weiter, stufen- 
weise, in eigene Tatigkeit iibernehmen, ohne das vorher gekaufte Ge- 
rate iiberf liissig werden. Veitere Vorteile haben sich bei dieser Form 
der Microfiche - Herstellung ergeben, da bei spielsweise Microfiche, 
die nicht abgeschlossene Berichte enthalten , durch zusatzliche Mon- 
tage neuer Seiten auf den neuesten Stand gebracht werden konnen. Bei 
diesen Erganzungen kann jeweils ein Datum mit verfilmt werden, urn so 
den jeweiligen Stand der Arbeiten kontrollier en zu konnen* Ein wei- 
terer Vorteil besteht darin , daft auch Halbtonbilder besser als bis- 
her im Microfiche wiedergegeben werden konnen, denn es ist nun mog- 
lich, eine Serie Halbtonbilder auf entsprech endem normalen l6mm Ki- 
nofilm aufzunehmen und in den Microfiche einzufiigen* Dadurch ver- 
schwindet der bisher sehr grolie Qualitatsunt erschi ed zwischen der 
Schrift und der Bildwidergabe . 

In Deutschland wird dieses Verfahren zur Herstellung von Microfiche, 
weil es problemlos und relativ billig ist, in steigendem Mafte, be- 
sonders bei Instituten und Dokumentationsste linn mit einem kleineren 
Mengenbedarf benutzt* Es ist selbstverstandli ch, daft Dokumentations - 
zentralen mit einem groften Durchgang von Reports auch in Deutschland 
auf automatische Auf nahmegera t e zuruckgreif en , fiir die sich dann ja 
auch eine Rentabilitat ergibt* Die Verbreitung des Microfiche wird 
noch weiter zunehmen, da bereits die ersten Microfiche - Farbfilme 
produziert sind, was besonders fiir die Medizin sehr interessant ist 
und da eine spatere automatische Sortiermogli chkeit bei Microfiche 
aller Hers tellungsverf ahren durch nachtraglich angebrachte Code - 
Leisten in Zukunft moglich werden wird* 
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DEVELOPMENT OF SPECIAL 
METHODS TO PRODUCE MICROFICHE 
ECONOMICALLY IN SMALL QUANTITIES 

Georg Thiele 

Translated by the author from the 
German original 



We note that the number of scientific and technical publications is increasing. The 
period during which printed paper was the only material for transferring and storing 
scientific knowledge is now coming to an end. Normally printed or typescript charac- * 
ters can be reduced up to 5Q%* But this is the limit if the characters are to be 
legible and - this is also important - it should be possible to make copies from them* 
It has become necessary now to look for new media in which information can be stored. 
These materials must meet two essential requirements* 

1* The storage capacity must be much higher than that of printed matter, 

2. Rapid and comparatively easy access to the stored information must be ensured. 

Besides new media for storing, information should ideally not be less convenient 
than printed paper, and should also not be very much more expensive. 

Two new media for the storage of information meet these requirements: Magnetic 
memory and microfilm. 

The storage capacity of both media is much higher than that of printed paper. 
Comparing these two materlals 9 the storage capacity of microfilm is higher than that 
of magnetic tape* On the other hand, the access to data stored in a magnetic memory 
is much more rapid. Therefore, in the field of documentation it has been proved use* 
ful to store in magnetic memories (tapes or discs) only such data which is relevant 
for retrieval purposes (keywords , descriptors, bibliographic data, etc.) and to store 
total Information (reports, papers, tables, diagrams) on microfilm. In using this 
method the original paper is optically reduced and stored in analogous form. In this 
connection 1 would like to add that microfilm is an analogous memory* 

Microfiche in the internationally standardized size of 105 x 1&8 mm is widely 
used. It is built up of an eye-readable headline and 5 lines with 12 pages on each. 
The 12 pages in each line may be 12 single pictures or 6 pictures with double-f rames • 
The pictures have a reduction ratio of 1:20. The capacity of a standard microfiche 
is in the range of at least 200,000 characters. In other microfilm systems reduction 
factors higher than 1:20 are used, which makes it possible to increase the storage 
capacity of microfiche. The internationally accepted form of microfiche, however, 
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which also corresponds to an ISO-Recommendation is meanwhile being widely used* It 
appears that the necessary equipment, "readers” and"read printers" for the internation- 
ally accepted size of the microfiche are available everywhere. 

For example in the U.S* the National Technical Information Service is distributing 
information in the form of microfiche. On the average 4,000 microfiches are mailed by 
the National Technical Information Service everyday. It is obvious that other smaller 
information centers would also like to introduce microfiche for several reasons, as it 
is also possible to merge smaller series of microfiche into larger collections. In 
many cases the introduction of microfiche is prevented by the fact that the available 
raicro-fiche cameras are very expensive. The prices vary between 20,000 and 35,000 U.S, 
dollars. In many cases this equipment cannot be used economically in smaller informa- 
tion centers. It was our intention to develop equipment for the production of micro- 
fiche, which can be adapted step by step to the number of microfiche to be produced. 

At first it had to be ensured that 16 mm Microfilm cameras, flow cameras and pla- 
netary cameras were available which could take pictures in steps of 11.75 ram. This 
correspond exactly to the internationally standardized Microfiche step. 

One manufacturer of microfilm equipment, Microfilm GmbH, developed an inter-change- 
able magazine for 16 mm film which has a fixed step of 11.75 mm. This magazine has been 
designed for Microfiche’s 35 mm planetary cameras, which are in use in many libraries 
and information centers. So it is possible to produce microfilm 16 mm with microfiche- 
step, According to our specifications the firm of Kalle has developed a special 
mounter-table on which the film is cut into 12 exposure strips and the strips are then 
fixed in the mounter. The same firm has developed an acetate self-adhesive transparent 
film in fiche size, i.e. 10.5 x 14.8 cm. The fiche film is then pressed on to the 
mounting table and the film strips adhere to the transparent fiche film. The title 
of the microfiche can be copied from a typescript original using a Diffusion Transfer 
process or by other reprographic methods. This film strip with the title is then also 
mounted on the self-adhesive film. From this master a microfiche can now be produced, 
by using a diazo copying method. Microfiche made in the way just described is in 
accordance with international standards. It is said that mounted microfiche duplicates 
will be accepted as valid microfiche as long as they conform to the relevant interna- 
tional standards, and if they are of the same quality as other microfiche-copies. 

The splitting up of the technical process for producing microfiche in sub-processes 
offers great advantages. If microfiche are only produced on a small scale, a 16 mm 
camera is sufficient. The price of such a camera is around $1,500. Microfilm can be 
produced at the office and there is no need to send out the originals. In the simplest 
case, the microfilm is sent after exposure to a service bureau which develops the micro- 
film, mounts the microfiche master and produces the copies. 

In the Federal Republic of Germany, such services are run by the Arbeitsstelle fur 
Reprotechnik at Frankfurt. In addition, there is also an automatic enlarger which 
produces readable hard copies of the microfiche. 

But if microfiche is later to be produced on a larger scale, the office can purchase 
the additional equipment which is needed to undertake all the production processes on 
its own premises without having to discard the already available equipment. There is 
another advantage in utilizing this way of producing microfiche. Far instance, a micro- 
fiche of a report, which has not been finished can be brought up to date by adding 
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pages. In this case, the date can also be put on the microfiche so that the progress 
made In the work can be checked at any time. A further advantage Is that In the most 
widely used negative microfiche, the continuous tone reproductions also appear In ne- 
gative form. Now these pictures can be produced In a better way. It Is now possible 
to mount 16 mm diapositives In the master microfiche. So there is no longer the dif- 
ference in quality between the reproduction of text and pictures. 

In Germany, this method of producing microfiche is being used more and more, because 
it is easy to operate and comparatively inexpensive. It is especially widely used in 
research institutes and information centers which are using microfiche on a relatively 
smaller scale. It is obvious that information centers which handle reports on a large 
scale will use automatic equipment which will then be more economic. There is no 
doubt that microfiche will be used more and more as recently it was shown that 
colour-microfiche could be also produced. This may be of special importance in the 
field of medicine. In Germany the first periodicals in the area of natural sciences 
will be published in the form of microfiche. For other purposes special code-strips 
are attached to the microfiche duplicate. This will also have the effect of increas- 
ing the use of microfilm. 
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SYNOPSIS 

Conclusions about reprography in information services were reached 
in industrialized countries* They are not necessarily true for 
young nations* Differences are discussed concerning gathering of 
information and its dissemination • Economic considerations are 
stressed* Local technical and organizational limitations are taken 
into account. Training abroad is recommended* 

Reprography, Information Services and Developing Countries are in themselves terms 
extensively considered in the literature. Much practical experience has also been 
gained concerning reprography's contribution to information services* For obvious 
reasons, theoretical as well as practical, conclusions have been reached in this 
field in the leading countries which happen to be the highly industrialized ones. 

No direct conclusion can necessarily be drawn from their experience for the benefit 
of the young nations. The foremost among the reasons would be the considerable 
difference in economic development and resulting possibilities. Organizational and 
administrational limitations in developing countries also make it often difficult to 
take full advantage of the reprographic techniques available for information centres. 
Commonly, their services are supposed to include the gathering of information, its 
processing and dissemination. In this context, it will have to be considered which of 
the existing reprographic techniques could best be applied for the benefit* of 
developing countries. 

Gathering information in a developing country most often means receiving desired 
particulars from abroad. Concerning the text of whole books and serials, the original 
publication is still the most desireable one, even where micro-publications might be 
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available. Microcards may serve as a typical point in case* Some years ago, their 
distribution was rather popular for some time, paricularly from various American 
channels. Considering the great need for specialized information, such as atomic 
energy reports, medical research papers and similar publications for the United States 
and other industrialized countries, their libraries and information services had to be 
equipped with special microcard readers for that purpose. The organizational changes 
and capital investment involved were certainly worth it, considering the large number 
of microcards required within a short time. After a while, this particular technique 
proved to be not really satisfactory and was largely replaced by the transparent 
microfiche. In this way, micro card-readers rapidly became obsolete, not without having 
served their purpose while it lasted. Developing countries, however, had caught up 
with the microcard technique only at a later stage and to a ouch lesaer extent, hereby 
channelling capital investment into a doubtful economic undertaking. 

In spite of that hardly avoidable experience, microcopies prove a most useful tool 
for gathering information from irregular sources. This is especially true for develop- 
ing countries, the literary resources of which are mostly limited as far a a scien- 
tific and technical information is concerned. Usually, they also happen to be 
situated at a geographical distance from the main centres of information. They there- 
fore have to rely on air-mailed communications incurring considerable postal charges. 
In these cases, microcopies prove an ideal medium. Learning the lesson, the inform- 
ation centres of developing countries ought to be careful in evaluating microfiche 
and its latest stages: Supers and Ultrafiche for their purpose. Tne whole of this 
field is still in a rather fluid state of developnent. Besides, here again, a spe- 
cific consideration, typical for developing countries, should be mentioned. It ap- 
\ pears, namely, that microfiche readers can be marketed at a much more attractive 

? price v ,han the usual afcrofilm readers. The price of these microfiche readers proves 

' reasonable indeed in relation to the high economic standard of the countries of 

j origin. In consequence, the microfiche system becomes rapidly popular there. The 

\ same reader, however, becomes rather expensive when marketed in countries importing 

their equipment with heavy duties attached, in addition to freight and insurance 
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charges involved. The considerably enlarged price must now be related to the economic 
standard prevailing at the readers destination which, in the case of one of the 
developing countries, will compare unfavourably with that of the country of origin. 
Even for that reason alone, microfiche cannot be expected to become as popular in the 
developing countries as in the industrialized ones in the foreseeable future, not- 
withstanding its particular advantage for information services as such. 

With due respect, therefore, to the care needed in choosing appropriate repro- 
graphic media for gathering information in developing countries, the advantage of 
microcopies cannot be overstressed. At the same time, it ought to be realized that 
these copies have in the first place always been looked upon as a medium to be 
scanned for the importance of its contents. Microfilms are not supposed to be read 
for any length of time. Instead, enlargements should be requested of frames of 
particular interest which, by way of scanning the film, appeared to be important. 
Microcopies as such remain the cheapest means for the acquisition of information and 
its storage,- more specifically for developing countries. Portions of particular 
interest, however, should be made available to the information centre* s customer, 
where possible, by way of paper enlargements. 

For processing of information and its retrieval, microcopies prove also a medium of 
advantage, e.g. where the classification of the information can be included in the 
microcopy at the time of its production. Once more, the microfiche has a good point 
in case. Much of the information required for retrieval can be included at the head 
of the fiche, where it can be read by the naked eye. Such a fiche, put in an envelope 
for its protection, can actually serve as a kind of index-card and source of inform- 
ationat one and the same time; because, once taken out of the card-file, the fiche 
serves as the actual material one has been looking for. Here again, information 
services operating in developing countries may find themselves in a less fortunate 
state, except where they practically act as the producer of the microcopy involved. 
However, where microcopies are received from abroad without classification, alter- 
native solutions may be available. If the copy arrives in the foim of rollfilra, 
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the lengthy material may be cut into strips, respectively of 10 single or 5 double 
frames* A heading is attached above each strip, which has been written by typewriter 
and therefore is readable with the naked eye* Formerly, for this kind of strip with 
heading attached, special envelopes had been readiliy available* Unfortunately, it 
arpears that they have gone somewhat out of fashion; but they may still be a solution 
for developing countries* An improved and mechanized version of this technique can be 
found in Kodak's "Micros trip 11 • Thi3 system, in turn, is largely overshadowed nowadays 
by the more sophisticated systems of data processing and storing* In most cases, the 
latter ones will prove, though useful, much too expensive for the limited possibili- 
ties of developing countries as against their extensive use in industrialized ones, 
for which these complicated and costly systems were originally conceived* 

Generally speaking, the dissemination of information should preferably be handled by 
way of paper copies* These are produced from original sources available at the in- 
formation centre* No real difference exists between the techniques used for that pur- 
pose in developing or industrialized countries* The great number of copier models, 
their materials and prices, however, make it essential for information services in 
developing countries to choose the appropriate product more carefully. Developing 
countries, usually being limited by their resources of foreign currency, even may 
have to give preference to techniques based on materials locally available, including 
advanced applications for diazo copying* In any case, the number of copies needed in 
the foreseeable future should be taken as basis of calculation for investment in a 
particular product; and although each of these models may be capable to serve for 
many years to come, another typical difference between industrialized and developing 
countries exists* In the former ones, the chosen model probably returns the invest- 
ment through its use during two or three years* By that time, a more advanced model 
- or possibly new technique - may become available which would make it desirable to 
change the existing equipment, though still usable, against new one. Such a change 
would be economically sound* In a developing country, rentability may be less obvious 
after the same period* In that case, u change of model might still be desirable but 
without economic justification. It is for these reasons, that the choice of the 
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appropriate copier must be made much more carefully in the case of developing count- 
ries*. 



Similar cons ide rat ions exist where a large number of copies from the same document 
has to be diiributed. These copies may be produced either by one of the duplicating 
processes, such as small-offset printing and spirit duplicating, or by making use of 
an additional counter at an automatic Xerox copier which makes it possible to photo- 
graph, aa from the sixth copy, at a considerably reduced price. Weighing these and 
similar techniques may become of decisive importance for the economics of information 
services in developing countries*, because, even economizing in a small way may, in 
the long run, produce important results. Limited editions of, say ten to fifty copies 
are typical for this kind of work. Prom there onwards, small-offset printing probably 
remains the only solution up to the stage where full printing processes become 
justified. 

Where small-offset printing has been chosen for duplicating, a decision about pre- 
paring the offset master must yet be taken. Specialized, high quality equipment can 
be bought for that purpose. In developing countries, however, one may want to take 
advantage of electrophotographic copying machines already existing, in order to 
produce masters in a cheaper way with satisfactory results. Actually, the whole of 
these dissemination problems requires not only reprographic knowledge and experience 
but also an understanding of distribution matters and publishing of limited editions 
which are so typical for the requirements of information services. In industrialized 
countries, these centres of information may consider as their foremost duty to answer 
requests received from interested parties. In developing countries, the same centres 
may have to go out of their way in order to stimulate local industry, for instance, 
into testing new possibilities gathered from publications abroad. Suggestions of this 
kind may have to be sent at random to all concerned. The distribution of a few dozens 
of copies is therefore typical for the needs of developing countries. 

Considerations like shortage of foreign currency may stimulate the setting-up of 
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local factories for the production of certain reprographic material and, possibly,', 
ever* some of the simpler models of equipment • Progress made in this direction in 
India, for instance, could serve as a guide* Nonetheless, careful economic calcul- 
ations should show if local manufacturing would indeed prove of advantage. On the 
other hand, a case such as the local production of reasonably priced microfiche 
readers could become almost decisive for the introduction of new reprographic tech- 
niques in the service of information as pointed out earlier. Instead of designing 
basically new equipment, arrangements with manufacturers in industrialized countries 
by payment of royalties may be preferable. Even assembling parts manufactured abroad 
may bring about some valuable savings in foreign currency. Unfortunately, developing 
countries are usually also those which suffer from lack of the necessary number of 
technicians. This shortage makes itself felt even in the proper handling and upkeep 
of the more sophisticated equipment, as required by information centres* It must be 
realized that, although the famous slogan "Any office girl can do it" may basically 
be true, it is the well trained and experienced technician who must be available to 
the untrained personnel in order to overcome unavoidable short stoppages in the smooth 
running of the machines. This technical point becomes more apparent in developing 
countries, as the manufacturer’s own technicians or those of his representative may 
be stationed far away. Any information centre based on reprographic services without 
the assurance of adequate technical help will certainly find itself limited in dis- 
charging its duties. 

It would be wrong, however, to consider the utilization of reprographic methods from 
a technical point of view only. After all, information services are run also in a 
certain organizational and administrative framework. This framework in itself must 
be in a certain relation to the country's general organizational capabilities and 
its administrative network* These again may yet be less developed among the younger 
nations* For that reason, reprographic techniques which satisfied in the framework 
which they were intended for, may not be applicable for developing countries. For 
the solution of this kind of problem a local trainee ought to be sent abroad, prefer- 
ably to a smaller one of the industrialized countries. On his return, tho by now 
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truined professional, experienced in the local difficulties of his native country, 
will become the appropriate person to find the desired solution* Jr. any case, so- 
lutions - although baaed on the experience gained among industrialized nations - have 
to be adjusted to local habits and possibilities* Prestige requests for sophisticated 
equipment mU3t also be avoided as long as its true application cannot be utilized* 
Modern reprographic equipnent, often baaed on ehitronic machinery, i3 typical for 
this kind of consideration* 

With due care, however, reprographic means will prove a powerful aid to 
running modern information services even in developing countries* With reprography’s 
aid, centres of information can play their proper role in the international network 
of information services* Through thin cooperation,- not only developing countries, 
but the industrialized nations too, may gain important advantages* It is well known 
that experience gained in developing countries may be of significance even for the 
industrialized nations as far as similar physical conditions, for instance, exist* 
The language used in announcing the results of these experiments may not be known 
outside their country of publication* Information services would then be looked to 
for translating the main features of these results for the benefit of interested 
parties abroad a"d for disseminating them accordingly* In such a connection, repro- 
graphy will prove a decisive medium for international cooperation* 
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SYNOPSIS 

The limited font of characters of the used 
>' machines is one of the main problems in the 
field of non-numerical data processing. The 
• example of the production of the "Deutsche 

'•» Bibliographic" should shown, how the Center 

for Mechanisation in Documentation has 
solved this problem. 

Im Jahre 1964 begann man bei der Deutschen Bibliothek in 
Frankfurt (Main) damit, die Hoglichkeiten des Einsatzes 
der elektronischen Datenverarbeitung bei der Herstellung. 
der Deutschen Bibliographic su priifen. Die standig stei- . 
gende Anzahl von Veroffentlichungen und der Personalman- 
gel erforderten geeignete Hafinahmen, um den Erstellungs- 
zeitraum der Bibliographic, der damals z.B. fiir ein Halb- 
jahresverzeichnis bei 16 bis 18 Monaten lag zu verkiirzen. 
Die Zentralstelle fiir maschinelle Dokumentation(ZMD) ent- 
wickelte ein System, welches alia Arbeiten, die zwischen 
der Titelaufnahme und dem Druck samtlicher Verzeichnisse 
und Register liegen, von einem Computer ausfiihren lieB. 
Als output lieferte dieser Computer einen TTS-Lochstrei- 
fen zur Steuerung von Bleisetzmaschinen. 

So erschien am 6. Januar 1966 als erste in der Welt eine 
National-Bibliographie , die mit Hilfe eines Computers 
erstellt worden war. Damit war das grundsatzliche Problem 
gelost, Bibliographien, Register, Kataloge u.a. in typo- 
graphisch ansprechender Form maschinell herzustellen. 
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.,erm dieses erstmals praktizierte System nicht frei war 
von hiingeln, so v/aren diose Mangel in erster Linie zuriick- 
zufiihren auf gewisse Unzuliinglichkeiten der eingesetzten 
Naschinen* 

jas zeigte sich schon bei den fur die Datenerfassung ver- 
wendeten Lochstreifen-Gchreibmaschinen, die bei 44 Tasten 
in Groft- und Ivleinumschaltung lediglich die direkte Dar- 
stellung von maximal 88 Zeichen zulassen. Bei einem der- 
urt geringen Zeichenvorrat ist vielleicbt noch die Her- 
stellung einfachster Kataloge moglich, nicht aber die 
Herstellung einer National-Bibliographie • Das Problem des 
begrenzten Zeichenvorrats stellte sich auch bei der eigent- 
lichen Datanverarbeitung, denn der benutzte Computer 
IBh 1460 arbeitet im BCD-Code* Im BCD-Code sind nur 64 
Zeichen direkt darstellbar* 

Den gestellten Ansprilchen nicht gerecht wurde auch die 
fiir die Ausgabe eingesetzte Bleisetzmaschine LINOQUICK. 
ills nachteilig wurden insbesondere die im Verhaltnis zur 
Verarbeitungsgeschwindigkeit des Computers geringe Setz- 
geschwindigkeit , die unzureichenden Moglichkeiten der 
typographischen Gestaltung und die relative Haufigkeit 
von Setzfehlern empfunden* Gchlieftlich waren auch Silben- 
trennung, Fahnenkorrektur und Umbruch durch den Computer 
noch nicht moglich. 

Deshalb entwickelte die ZMD in enger Zu3ammenarbeit mit 
der Deutschen Bibliothek ein verbessertes oystem fiir den 
Computersatz* Die Verbesserungen reichen von einer mehr 
analytischen Datenerfassung iiber programmtechnische Ver- 
feinerungen und Erweiterungen bis hin zum Einsatz einer 
ungleich leistungsfahigeren Photosetzmaschine* 

Die Grundkonzeption dieses verbesserten Systems mochte 
ich Ihnen im folgenden kurz darlegen* Dabei lassen Sie 
mich Ihnen zunachst den organisatorischen Ablauf des Pro- 
jektes erlautern, urn anschlieftend zu zeigen, wie die 
beschriebenen Mangel und Nachteile des alten Systems 
beseitigt wurden* 

Die Erfassung der Baton erfolgt auf 8-Kanal-Lochstreif en* 
Diese Daten werden off-line mit Hilfe eines Konverters 
zuerst auf ein Magnetband iiber bragen, um die Eingabege- 
schwindigkeit in den Computer zu erhohen* 




Computer* Typesetting 



Bevor mit der eigentlichen Verarbeitung begonnen werden 
kann, muB das Material bestimmten Gultigkeits- und 
Plausibilitatskontrollen unterzogen werden. Die vom Prlif- 
programm festgestellten Fehler werden mit einem Ilorrektur- 
programm beseitigt. Dieser Zyklus wird solange durchlaufen, 
bis das Material fehlerfrei ist. 

In der nachsten Phase wird das Material fur die weitere 
Verarbeitung aufbereitet. Das bedeutet vor allem Zeichen- 
umwandlungen, Textumsetzungen (z.B. Herausziehen von Autoren, 
Deskriptoren u.a. fiir die Erstellung von Hegistern) , 
Sortieraufbereitungen und anschlieBendes Sortieren oder 
Mischen der Daten. 

In der Ausgabephase schlieBlich werden die Daten fiir die 
Setzmaschine zusammengestellt « Das heiBt: Zeilenaufberei- 
tung mit Silbentrennung, Umsetzung des Computer-Codes 
in den Code der Setzmaschine und Einfugen der Steuerzeichen 
fiir den Photosatz. 

Das Material wird dann abgesetzt auf Photopapier. Jede 
Zeile ist in dieser Phase noch mit einer laufenden Nummer 
versehen. Anhand dieser Nummer wird in einem weiteren 
Programmlauf die Pahnenkorrektur durchgef ixhrt • Sind die 
Daten fehlerfrei, erfolgt der Umbruch und schlieBlich 
der Satz des umbrochenen Materials auf Film. Damit endet 
die Verarbeitungskette durch Computer bzw. Setzmaschine. 

Folgende Maschinen und Gerate werden verwendet: 

Fiir die Datenerf assung wird eine Lochstreifenschreibmaschine 
vom Typ VONAMATIC benutzt; die Konvertierung vom Lochstreifen 
auf Magnetband erfolgt auf einem Konverter GIER 3000. Der 
eingesetzte Computer ist eine IBM 1460 (16K) mit ange- 
schlossenen 6 Magnetbandeinheiten, 5 Magnetplatteneinheiten, 
Schnelldrucker und Kartenabfiihl- und -stanzeinheit. Fiir 
die Silbentrennung und die Umsetzung in den Code der Setz- 
maschine wird eine IBM 1130 (16K) verwendet. Die Setz- 
maschine schlieBlich ist eine Photosetzmaschine vom Typ 
LINOTYPE 505. 

Nachdem auf der Ausgabeseite eine Maschine zur Verfiigung 
stand, die bei einem Zeichenvorrat von rund 1000 Zeichen 
wohl alien Anforderungen gerecht wird, gait es in erster 
Linie das Problem des begrenzten Zeichenvorrat s bei der 
Datenerf assung und -verarbeitung zu losen. Die bereits 
erwahnte Lochstreifenschreibmaschine hat einen Zeichen- 
vorrat von 88 Zeichen. Sicherlich gibt es Erf assungsgeriite, 
mit denen mehr Zeichen darstellbar sind. Es gibt aber 
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bisher kcin Gerat, ir.it dem alle gewiinschten Zeichen zu 
erfassen wiiren. Auf der andoren Seite ist ein erhohter 
Zeichenvorrat bei der Srfassung nur dann sinnvoll, wenn 
diese Zeichen auch zu verarbeiten sind. Das ist direkt 
bei deni Zeichenvorrat von 64 Zeichen der IBM 1460 nicht 
moglich. .iuch der Sinsatz etua einer Byte-Iiaschine wiirde 
dieses Problem nicht losen, denn hier liegt die Begren- 
zung bei 256 Zeichen. Dieser Zeichenvorrat wiirde zwar 
fur die Herstellung der Deutschen 3ibliographie ausreichen. 

Da aber das Problem des begrenzten Zeichenvorrats ein 
generellcs l-'roblen in der nicht-numerischen Datenverarbei- 
tung ist, d.h. die iiaschinen lassen die direkte Verarbei- 
tung aller gevninschten Zeichen nicht zu, muBten andere 
;iege beschritten werden. 

Sine generelle Losung dieses Problems hat die ZIID mit der 
Entwicklung der sogenannten Protypen gefunden. Protypen 
sind provisorische Typen, die nicht vorhandene Zeichen 
ersetzen und erst bei der Ausgabe durch die Getzmaschine 
auf die urspriinglichen Zeichen zuriickgefuhrt werden. 

Dadurch ist es moglich, alle gewiinschten Zeichen ohne 
Inf ormationsverlust zu erfassen und zu verarbeiten, auch 
wenn die verwendeten Iiaschinen nur einen begrenzten Zeichen- 
vorrat haben. Das Prinzip, bestimmte nicht vorhandene 
Zeichen durch vorhandene Zeichen zu ersetzen, ist im Drucke- 
reiwesen schon lange bekannt. Die ZhD hat dieses Prinzip 
erweitert und in eine systenatische Ordnung gebracht. 

Das hat die Verarbeitung dieser Zeichen im Computer 
erleichtert bzw. uberhaupt erst ermoglicht. 

Doch wie sehen diese Protypen aus? wie sind sie strukturiert? 

Eine Protype besteht grundsatzlich aus drei Zeichen: einem 
Grundzeichen (Grundbuchstabe Oder Grundziffer) und zwei 
Folgeziffern. Zur Ilennzeichnung dieser drei Zeichen als 
Protype' wird ein Gteuerzeichen (z.B. 8) vorangestellt . 

Ein nachfolgendes Steuerzeichen ist nicht erf orderlich, 
da eine Protype immer dreistellig ist. 

Wir kennen zwei Arten von Protypen: 

1. Protypen zur Darotellung von Sonderbuchstaben. 

Die Protype besteht aus einem Grundbuchstaben und 
zwei Folgeziffern; 

z.B. Gonderbuchstabe 6 wird dargestellt als 




~ * Gteuerzeichen 
•j ■ ■ -►Grundbuchstabe 
— — ►Folgeziffern 
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2. Protypen zur Darstellung von Sonderseichen. 

Die Protype besteht aus einer Grundziffer und zwei 
Folgeziffern; 

z.B. Sonderzeichen % wird dargestellt als 
8364 

TT_ 

► Gteuerzeichen 

►Grundziffer 

► Folgeziffern 

Sine vollstandige Liste aller Protypen finden Sie in der 
Verof f entlichung der ZMD: n Typen und Protypen fUr den 
Fotosatz mit der LINOTRON 505” • Hrsg.: ZMD (ZKD-A-22). 
Berlin, Koln, Frankfurt ( Main ) : Beuth-Vertrieb GmbH, 1970, 
30 S. 

Die Vorteile dieser Form der Zeichenverschliisselung sind: 

1. Zur Darstellung der Protypen werden lediglich die 
26 Buchstaben des Alphabets und die 10 Ziff era 
benotigt. Das bedeutet: Protypen sind mit jeder 
normalen Lochstreifenschreibmaschine zu erfassen. 

2. Das System der Protypen ist offen, d.h. gewiinschte 
Erganzungen durch den Benutzer sind moglich. 

3. Die Struktur der Protypen ist immer gleich. Das 
erleichtert das Ablochen bei der Erfassung und 
macht, insbesondere durch die Verwendung des 
Grundbuchstabens, die Protypen lesbar. 

4. Protypen sind sortierfahig. 

5. Alle Protypen sind ausnahraslos auch als Indizes 
darstellbar. Zu ihrer Kennzeichnung ist lediglich 
ein zusatzliches Steuerzeichen notwendig (z.B. 
Dber- Oder Unterstreichungszeichen) • 

Diesen Vorteilen steht als Nachteil eine quantitative 
Erweiterung des zu erfassenden Materials gegentiber, denn 
es sind zur Darstellung eines Zeichens vier Zeichen abzu- 
lochen. Doch wird man diesen Nachteil sicherlich in Kauf 
nehmen, wenn es darura geht, alle Daten ohne Inf ormations- 
verlust zu erfassen und zu verarbeiten. 

Nachdem das Problem der Vollstandigkeit bei der Erfassung 
der Daten durch die Verwendung von Protypen gelost war, 
gait es, das Material so zu erfassen, daft ein weitgehend 
direkter Zugriff zu den einzelnen Daten moglich ist. 

Dazu ist es notwendig, dieoe Einzoldaten zu kennzeichnen. 
Eine Kennzeichnung der Daten ist grundsatzlich moglich 
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durch Steuerzeichen oder durch die Verwendung von Kate- 
gorien. Uie Deutsche Bibliothek hat ein Erf as sungs schema 
erarbeitet, das die Benutzung von 99 unterschiedlichen 
Eategorien zulaBt. In der ersten Phase der Verarheitung 
durch den Computer werden zunachst die Adressen dieser 
Kategorien (Adresse = Gtelle innerhalh der Titelaufnahme) 
in einem AdreBbuch zusammengestellt . Bei einem derart 
analytischen Erf assungsschema ist dann \iber das AdreBbuch 
ein direkter Zugriff zu den Daten moglich. 

Die programmtechnischen Verbesserungen des neuen Systems 
beziehen sich in erster Linie auf die Ausgabephase , in 
der die Daten fur den eigentlichen Computersatz auf der 
LINOTRON 505 zusammengestellt werden. Es war das Ziel, 
die beim Bleisatz notwendigen manuellen Eingriffe weitest- 
gehend auszuschalten und die Moglichkeiten der Photosetz- 
maschine voll auszunutzen. 

Als Vorteile dieser neuen Konzeption seien genannt: 



1. Hohere Setzgeschwindigkeit 

Eine Aussage iiber die tatsachliche Geschwindigkeit 
einer Setzmaschine fallt schwer ? denn die Setz- 
geschwindigkeit ist u.a. abhangig von der Anzahl 
der belegten Hagazine(Bleisatz) bzw. Rahmen(Photo- 
satz), der SchriftgroBe und der Zeilenlange. Geht 
man davon aus, daB nur ein Magazin bzw. Rahmen 
belegt ist, daB die SchriftgroBe 8 Punkt betragt 
und die Zeilenlange 50 Zeichen nicht uberschreitet , 
so setzt nach unseren Erfahrungen die Bleisetz- 
maschine LINOQUICK ca. 5 bis 5 Zeichen und die 
Photosetzmaschine LINOTRON 505 ca. 70 bis 80 Zeichen 
pro Sekunde. Beim Photosatz wird also etwa die 
20-fache Setzgeschwindigkeit erreicht. 

Dadurch ist die Diskrepanz in der Geschwindigkeit 
zwischen dem Computer und der Setzmaschine zwar 
nicht beseitigt, aber doch verringert. AuBerdem 
wird der Computer zeitlich weniger belastet, denn 
statt des bisherigen TTS-Lochstreifens wird ietzt 
ein Nagnetband zur Steuerung der Setzmaschine 
erstellt. 

2. Bessere typographische Gestaltung 

Auf der Bleisetzmaschine LINOQUICK wurde nur in 
einer SchriftgroBe gesetzt. Das fiihrte zu einer 
gewissen Eintonigkeit des Schriftbildea. Die 
LINOTRON 505 kennt 15 unterschiedliche Schriftgro- 
Ben. Per Programm werden die gewiinschten Schrift- 



groBen fiir Uberschriften, Text, Durchschiisae, 
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FuBnoten, Nebeneintragungen u.a. vergeben und der 
LINOTRON 505 uber Steuerbefehle mitgeteilt. /iuch 
wurde eine Verbesserung des Schriftbildes durch 
eine maschinell durchgefiibrte Silbentrennung 
erreicht . 

3. Ausachaltung manueller Eingriffe 

Manuelle Eingriffe an der Bleisetzmaschine sind 
immer dann notwendig, wenn ein Zeichen gesetzt 
werden soli, das nicht erf asst v/erden konnte. 

Entweder die Maschine lauft auf einen Stop und 
der Setzer muB das gewiinschte Zeichen in die Maschine 
eintasten oder das fehlende Zeichen wird nachtrag- 
lich hineinkorrigiert. Da nur in einer Schriftgrosse 
abgesetzt vmrde, muBten auch die Uberschriften 
manuell gesetzt werden. SchlieBlich waren auch die 
notwendigen Fahnenkorrekturen und der Umbruch vom 
Setzer durchzufiihren. 

Alle diese manuellen Eingriffe sind jetzt nicht 
mehr notwendig, den n es sind bei Verwcndung der 
Protypen alle Zeichen zu erfassen und deshalb per 
Programm bei der Ausgabe direkt in den einzelnen 
Rahmen der LINOTRON 505 anzusteuern. Die Uber- 
schriften werden vom Programm in den Text eingefiigt. 
Auch die Korrektur an den abgesetzten Fahnen und 
der zweispaltige Umbruch einschlieBlich des Setzens 
einer lebenden Kolumne und des Auftreibens der 
Spalten werden vom Computer ausgefiihrt. Das ver- 
ringert nicht nur die reine Absetzzeit ganz erheb- 
lich, sondern hilft auch, Fehler zu vermeiden, die 
vor allem beim manuellen Einfiigen der Uberschriften 
immer wieder vorkamen. 

Zusammenf assend laBt sich sagen: Die Verwendung der Pro- 
typen und die programmtechnische Beriicksichtigung der 
maschinellen Gegebenheiten der Photosetzmaschine erforder- 
ten einen nicht unerheblichen zusatzlichen Programmier- 
aufwand. Doch wird die Frage des Aufwandes bedeutungslos , 
wenn man an die beschriebenen Vorteile denkt. Ich darf 
daran erinnern, daB es bei dieser Form des Computersatzes 
ein Zeichenvorratsproblem nicht mehr gibt, daB sich die 
Vorteile in der Setzgeschwindigkeit und den umf angreicheren 
typographischen Gestaltungsmoglichkeiten zeigen und daB 
schlieBlich manuelle Eingriffe nicht mehr notwendig sind, 
da Silbentrennung, Fahnenkorrektur und Umbruch per Programm 
durchgefiihrt werden. 
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COMPUTER TYPESETTING WITH AN 
EXTENDED FONT OF CHARACTERS 

Horst Zuchel 

Translated by the author from the German original 

In 1964, the German Library in Frankfurt started to check the possibilities of using 
an electronic data processing machine for production of the German Bibliography, The 
constantly increasing number of publications and the staff needs required appropriate 
steps in order to shorten the period of production of the Bibliography, At that time, 
the period of production was about 16 until 18 months. The system developed by the 
ZMD, the Center for Mechanization and Documentation, allowed to process data by com- 
puter from the data recording until the printing phase. As output, the computer 
delivered a six-channel TTS-punched paper tape for steering a line casting machine. 

So, the German National Bibliography was the first national bibliography in the 
world produced with the aid of a computer. It was published on January 6, 1966, 

Therewith, the fundamental problem was solved in a typographically appropriate 
manner to produce automatically bibliographies, registers, catalogs, etc. But this 
system wasn’t quite satisfactory. First of all, it used machines that had to be re- 
garded as weak points. That would be found at the punched paper tape typewriter 
used for data recording. Usually these machines have 44 keys in upper and lower 
case which allows for the representation of 88 characters only. Such a limited font 
of characters seems to make it impossible to produce a national bibliography which 
could meet the approval of all users. The problem of the limited font of characters 
was also found at the stage of real data processing, because the computer used (an 
IBM 1460) has the BCD code. Only 64 characters can be represented in this code. 

The linecasting machine, LINOQUICK, used in the output stage, also did not satisfy 
the required demands. 

The speed of typesetting is too low in relation to the speed of the computer. 

Also the possibilities of hyphenation, page make-up, galley-proof correction and of 
an appropriate typographical layout are limited. 

Therefore, the ZMD had developed, in cooperation with the Deutsche Bibliothek, 
a more efficient system of computer typesetting. 

The improvements range from a more analytical data recording and an extension 
and refinement of the programming system up to the employment of a photo composing 
machine, I want to present this more efficient system to you in the following, 

First of all, I want to point out the organizational running off and then I want 
to show you how we have eliminated the above described weak points. 



Computer Typeaetting 

The data ate recorded on 8-channel punched paper tape# These data are converted 
on a magnetic tape, in order to increase the input speed for the computer. Before we 
can start the real processing, we have to check and to correct the data. Then it is 
necessary to prepare the data for the following program. That means character trans- 
formation, permutations (for example, extract of authors, descriptors for producing 
registers), preparing for sort and the following sort or merge of data. 

In the output stage the data are prepared for the typesetting machine. That 
means line preparing and hyphenation, changing the computer code into the code of 
the typesetting machine, and to insert the control signs for the photo composition 
machine. Then the data are set up in type on the galley proof. Each line has a 
current number for proof corrections. If the data are flawless, we can do the page 
make-up and finally, the typesetting on film. This is the end of the processing 
chain. 

The photo composition machine, LINOTRON 505, has a font of about 1000 characters. 
That satisfies all required demands. 

Now, it was necessary to solve the problem of the limited font of characters at 
the data recording and processing. The mentioned paper tape typewriter has a font of 
88 characters, I am sure that there are recording machines which can represent more 
than 88 characters, but I am also sure that there is no machine which can represent 
all the characters you want. On the other hand, an extended font of characters at 
the data recording is only effective if you can process these characters. That is 
impossible with a font of 64 characters of the IBM 1460. Also the use of a type- 
machine wouldn*t solve this prcblem, because the limitation of this machine is about 
256 characters. Of course, this font of characters would be sufficient for product- 
ion of the German Bibliography, But as the problem of the limited font of characters 
is a general problem in the field of non-numerical data processing, it was necessary 
to find a way to extend the available font of characters as comprehensively as 
possible. 

The ZMD established systematic tables where special characters are represented as 
so-called "Protypes' 1 . Protypes stand for the original characters which cannot be 
otherwise represented during certain processing pahse, e,g, especially during data 
recording. In principle, the method of replacement is well known in the printing 
business. But in contrast to the old fashioned replacement methods, ZMD arranged 
comprehensive collections of special characters and organized them systematically in 
tables. The representation chosen takes into consideration a simple handling for 
data recording and the further computer processing. 

Seen in detail protypes consist of three characters: the basic letter or figure 
and two following figures. Protypes have to be marked in the text y a preceding 
control character, e.g. § , There is no necessity to mark the end of the protype 
because of its fixed length. 

There are two kinds of protypes: 1) protypes for representation of special letters. 
For example, the apecial letter, b, is represented as §a 22, with the paragraph sign 
as the control character for the protype, a as the basic letter, and 22 as the follo- 
wing figures, 2) Protypes for representation of special characters, e.g., the special 
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character %, is represented as § 364 with the paragraph sign as the control character, 
3 as the basic figure, and 64 as the following figures • 

The advantages of this kind of character encoding ares 

First; you need for representation of protypes only the 26 letters and the 10 fig- 
ures. That means protypes can be types, without exception, with any normal punched 
tape typewriter. 

Second, the system is open ended and further diacritics may be added and any cor- 
responding letter with its two-digit number has already its fixed place. 

Third, protypes are always built in the same way no matter what they are used for. 

Fourth, all protypes can be used without exception as indices. Only one addi- 
tional control character must be inserted between the first control character and the 
protype. The inserted control character indicates whether the following protypes is 
in superscript or in subscript position. The additional control character is repre- 
sented as overlining or underlining with a not movable type-bar of the typewriter. 

The protype itself remains unchanged. It is important for a simple handling of 
recording and for the proof reading. 

The disadvantages of this system have to be seen in a quantitative extension of 
the recorded data, because it is necessary to type four characters for the represen- 
tation of one character, but certainly, you would put up with this disadvantage if 
you have to record all data without loss of information. 

The improvements of the program were, first of all, the output stage. It was 
the aim to eliminate the manual handling by using a linecasting machine and to make 
the best of the possibilities of the photo composition machine. 

The advantages of this system in using a photo composition machine are; 

1. Higher speed of setting. It is very difficult to give a generally valid 
judgement on the real speed of a typesetting machine. The speed depends on the num- 
ber of covered magazines (by linecasting) or frames (by photo composition), the size 
of font and the length of the line. For example, if only one magazine or frame is 
covered, the size of the font may be 8 points and the length of a line doesn’t exceed 
50 characters, we have found that the line casting machine, LINOQUICK, sets up in 
type about 3 to 5 characters and the photo composition machine about 70 to 80 charac- 
ters per second. So we can say the photo composition machine is 20 times faster than 
the linecasting machine. 

2. More extensive typographical layout; the line casting machine LINOQUICK sets 
up in type in one size of font only which brings about a rather monotonous layout. 

The LIN0TR0N 505 has 15 different sizes of font. So we can in the program have the 
heading, the text, the spaceline, the footnote, etc. set up in the type all in dif- 
ferent sizes of font. Furthermore, we have improved the typographical layout by 
automatic hyphenation. 

3. Elimination of manual handling; manual handling on the linecasting machine 
is necessary, if there is a character to set up in type which could not be recorded. 
It is also necessary for setting up headings (titles), for proof-correction and for 
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pagination. All this manual handling is superfluous now in the new system. 

Finally, let me say: All desired characters we can record and directly set up in 
type on the photo composition machine. The headings are inserted into the text as 
well as the galley-proof correction and the page make-up are put through by program. 
That reduces not only the setting up time but also it means the avoidance of typical 
errors made in hot-metal typesetting. 
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MECHANIZATION OP PUBLICATIONS 



M.D. Martin, P. Simmons, M.J. Vernon 
(iNSPEC, The Institution of Electrical Engineers, London) 



SYNOPSIS 

INSPEC* s integrated computer-based production system 
for its publications is described in three parts: the 
background to the developments undertaken during the 
past four years; an overview of the system as it now 
operates; and a survey of some of the problem areas 
which have been encountered in day-to-day running. 

On January 2nd 1969 The Institution of Electrical Engineers (lEE) 
published the first issue of Physics Abstracts to have been 
produced by computer-controlled phototypesetting techniques. Since 
that date, the whole range of INSPEC publications - abstracts 
journals, indexes, and titles bulletins - has been compiled and 
produced as part of an integrated computer-based system. 

The present paper describes INSPEC* s experience with this system, 
concentrating particularly on those aspects which concern the 
production of printed publications, though it must be remembered 
that this is only one of a number of facets of the total system. 

The development of the INSPEC ^production system 

Since 1898, the IEE has published Science Abstracts, which today 
comprises three abstracts journals covering physics, electrical and 
electronic engineering, and computers and control. Each section has 
an associated titles publication under the generic name Current 
Papers . The abstracts are of papers selected from the periodical 
literature and from conference proceedings, books, reports, patents 
and dissertations. The present volume of publication amounts to 
some 1^0,000 abstracts per year. 

The abstracts are published within a subject classification having 
some 600 chapters and sub-chapters, and each is indexed by subject 
under some 3000 controlled headings, with free- language entries, to 
enable a six-monthly index to bo prepared. Each issue of the abstract 
journal carries an author index, which is also cumulated each six 
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months; and a number of other indexes are provided. The titles 
publications art? classified to the same level as the abstracts 
journals, but have neither abstracts nor indexes, and are designed to 
servo as a currcMit-awaroncss tool for individual scientists and 
engineers • 

The decision to mechanize the production of all these publications 
was taken early in 1967 , following a series of research studies. 

This decision would probably not have been taken - certainly not at 
such an early date - if it had been viewed solely as the 
mechanization of a conventional publishing operation, for it cannot 
be claimed to have reduced the printing and publishing costs, taken 
in isolation. Nor, on the other hand, has it significantly 
increased them; and it must be borne in mind that the break-even 
point between conventional print production and computer-controlled 
phototypesetting, in the context in which INSPEC operates, is almost 
certainly getting closer year by year. 

The importance of the decision to mechanize the INSPEC publications 
was that it represented a commitment to develop a total computer- 
based information service, of which printed journals would be only 
one of a number of products. Other products, such as magnetic tape 
services and SDI, are outside the scope of this paper, but they have 
been considered as an integrated part of the system from the very 
beginning. Their implementation has been phased over the past two 
years, and is continuing at the present time. 

The heart of the INSPEC production system is a single data base into 
which are entered bibliographic records covering the whole subject 
field of the services which INSPEC provides. Each record contains an 
abstract, bibliographic description, classification and subject 
indexing. From this single data base the required subsets are 
selected and processed to provide a variety of different outputs. 

System overview 

In outlining the production system ns it now operates, it is 
convenient to describe the procedures used for journal literature, 
which forms over 70 $ of the data base. For other types of literature, 
the system is similar in principle but differs in detail. 

At acquisition, the receipt of the journal issue is recorded, and a 
control sheet is attached to the contents page. The issue Is marked 
out to the particular group of information scientists which 
regularly handles this Journal, and an identification number is 
entered on both the control sheet and the contents page. The issue 
then passes to the group leader responsible, who marks up his 
selection on the contents page, by entering one or more of the 
letters, A, B or C against each title, to indicate whether the item 
is required for Physics Abstracts . Electrical A Electronic Abstracts 
or Computer A Control Abstracts . 
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Almost all English language material, or material carrying English 
abstracts, is dealt with in-house. Some foroign-languago journals 
are handled completely by a single outside abstractor; others are 
dealt with by sending individual papers to abstractors. 

For English-language material, the information scientists classify and 
index the paper (entering tho classification numbers and index 
entries on a work sheet). Thoy may compile an original abstract or 
edit, abridge or augment the author abstract, ns required. 

After tho editing stage is complete, the issue is returned to tho 
acquisitions section to bo marked off against their records, and tho 
copy prepared for keyboarding. Wherever possible, tho original title 
page of the paper, torn from the journal, is used as copy for the 
keyboard operators, who themselves pick out tho various data elements 
such as title, authors 1 names, abstract etc.; so that transcription is 
reduced to a minimum. The information scientists' worksheets are 
attached to the corresponding title pages, and the batch of copy is 
headed by tho journal control sheet. The contents page is filed as a 
historical record. 

At the present time, all input to tho data base is keyboarded on to 
paper tape, using specially modified Ploxowri tors • No capital letters 
are usedt instead, tho ribbon colour shift is used and capitals appear 
as lower-case letters in rod. Tho upper-case positions on the typo 
bars are used for common Greek and mathematical symbols, while other 
special symbols are entered by means of an escape code. 

The paper tapes are run on to tho computer and added to a "Current 
Pile 11 , of material awaiting publication. At the same time, 
validation checks are applied by computer program, and a proof listing 
with error reports is produced. Since tho line printer has a 
character sot of only 6*1, it is necessary to represent other characters 
in a special way. In general, this is done by putting tho name of tho 
character (or an abbreviation) between a pair of delimiters. Thus 
Greek y is printed as jfbAMHAjjf • 

Tho computer-produced proof is road by a specially- trained staff, and 
corrections are marked on tho printout. Papers which require no 
corrections are immediately released for production use; i.o. there is 
no need to wait for a complete primary journal issue to have boon 
corrected before releasing individual papers. 

Corrections are keyboarded on to paper tape, and merged with tho 
Current Pile as part of a single daily update, at the same timo as now 
items nrc added. 

Records are released into tho data base by directives on paper tapo 
which are also entered during tho sumo update run. Tho f coords 
remain physically on tho Current Pile until a Journal production run, 
at which point they are removed from tho file and processed for 
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publication • 

Tho production of an issuo of ono of INSPEC's Scionco Abstracts 
Journals starts at tho point when tho matorial for that issuo is 
alroady in oxistonco on tho Current Filo, having boon input, corroctod 
and roloasod sinco tho last issuo of that Journal. Tho procossos 
doscribcd apply equally to any of tho threo abstract Journalsj tho 
production procoduro for tho titles bulletins is ossontially similar, 
though somewhat simpler. 

Tho first computer program in tho production suite solocts matorial 
from tho Current Filo which is required for tho particular issuo, 
rojocting incomplete or uncorroctod items* Tho Current Filo is 
structured in batches, each of which roprosonts a complotc journal 
issue or other sourco publication, so that such information ns "Journal 
titlo" and "dnto of publication” nood bo stored only onco at tho hond 
of each batch. The next stago of processing involvos sorting tho 
Journal filo into classification order. At this point tho filo losos 
its former batched structure, so that it is necessary during tho 
selection run to ensure that tho information which previously 
appeared only in the batch bonder is now associated with each 
selected record individually. 

Tile selcctod filo is sorted into tho classification soqucnco indicated 
by a chapter code which is assigned to onch ontry by tho oditors. 
Additional chnptor codes are assigned for cronting cross- roforonco 
entries* Within the sequence indicated by classification codos, 
item* from tho same source publication nro grouped together in pngo 
number order. 

Tho next program in the sulto performs tho functions of numbering onch 
item on tho sorted filo and of cronting a filo of cross-re foroncos and 
an indox file. Tho numborod journal file contains all tho main 
ontrico for the abstract Journal in tho corroct classification 
sequonco. Knch entry on this filo has all tho information which was 
included in the original Current File rocord, and it is this filo 
which is merged with those from previous numborod issues to produce 
tho nrchivo filo for rotrospec tivo searching. Each entry in tho filo 
of croas-rofercncos includos a chapter codo, tho title of tho 
document, and the abstract number to which it rofors* Tho indox file 
includes author index entries and entrios for spocial conforonco, 
report, pntont, book and bibliography indoxos which appear in onch 
issuo of tho abstracts Journal* Since subjoct indoxos arc producod 
only on a six-monthly basis, thoir production is hnndlod soparatoly* 
Tho filo of cross-roforoncos is sortod into classification soquonco 
in tho samo way ns tho main Journal file. It is tit on morgod with tho 
numborod Journal filo and with u pormnnont filo of classification 
headings, to create the comploto toxt filo rondy for composition* 
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Up to this point the text files contain few of the codes needed to 
format the records for printing. Apart from upper and lower case 
shifts and occasional words or symbols in the text of the abstract 
which are required in italic or bold, the entries contain no stylo or 
formatting codes which, wore they present, would make searching or 
other uses of the data more difficult or less efficient. 

The formatting of the records for printing is achieved by computer 
program. It is done in two stages, mainly because of restrictions in 
the size of the computer store. The first stage combines all the data 
elements needed in each printed entry in the required sequence 
according to sots of parameters in the composition program. 

The typesetting software is highly generalized, and is driven by two 
levels of parameters. The first level identifies the boundary 
conditions required between records of different typos; i.o. the 
spacing and stylo changes which occur, for example, when a main 
heading is followed immediately by a sub-heading, or when an abstract 
is followed by a sub-heading. The second level identifies the data 
fields which are to bo selected from each typo of record, the sequence 
in which they are to bo printed, and the style, spacing and punctuation 
which is required for each field. Fixed literal information can also 
bo entered into the parameter list, and the parameter format allows 
cortain typos of conditional statement to bo used to vary the output 
in accordance with the presence or absence of selected fields. 

The structured records contained within the input file are thus 
converted into a continuous character stream including typographic 
function codes, and the program Justifies the text, hyphenating 
automatically whore necessary, and writes each lino forward to 
magnetic tape. This tape is input to the final program in the suite 
which breaks the text up into columns and adjusts the inter-entry 
spneing so that each column has the same length. The column make-up 
program also introduces running headings and page numbers, and 
performs the function of translating tho file into the code structure 
for driving the photo-composing machine. At present a Photon 713 
Toxtmastcr is being used to filmsot tho INSPEC publication. 

As far ns possible, INSPEC production software has boon written in a 
gonornl-purposo way in order to minimise tho programming effort 
required to process INSPEC files, and so that tho software can 
onsily bo adapted for file maintenance and typesetting work for other 
applications. 
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Probloma of a moehanizod publications system 

Tho re can bo no doubt that nil I lao major problems which liavo boon 
oncountorod in over two-and-a-hn I V yearn* running have fallen within 
what may broadly bo doacribod as the "input" end of the system, i*o. 
bo tween tho initial keyboarding of new items nnd thoir release for 
publ lea tion. 

Once a clean record has been roloascd into the data bnao v its 
subsequent processing is straightforward and, with proven programs, lias 
turned out to be almost entirely error-free, as might have boon 
anticipated. The only remaining source of difficulty has boon in tho 
compilation of periodic subject indexes, for two reasons: ono, that it 
lias not as yet been possible to implement a system for tho comploto 
validation of sub Joe t index headings at the input atago; and second, 
that indexers do still find It very desirable to bo able to review 
their choice of tonus in the light of tho appearance of tho final 
sorted index | so that it has been necossury to permit corrections to bo 
entered even at this lute stage. This is tho only situation in which 
tho system departs from the guiding principle that a cloan record 
should result from the initial input procedure, and bo processed 
thereafter without outside intervention. 

Tho final output from the system to tho photo composing machine has 
similarly been quite trouble-free. Some minor problems of tape 
compatibility and certain anomalies in tho control codes for the 
filmsottor were successfully overcome at tho beginning of tho project. 
Some slight misalignment occurs from time to time as a result of 
electro-mechanical faults, but this is normally identified by tho 
printer anil corrected by a re-run of tho relevant part of tho Job* 

It is worth pointing out that, while it is likely that tho system 
will bo transferred to a faster filmsottor in tho near future, tho 
comparatively slow speed of tho Lumitypo 713 (some 70,000 characters 
per hour) has never seemed to bo a matter of real concern. Tho move 
to a newer machine will bo Justified on grounds of cost and 
flexibility, not on grounds of speed. Tho improvement in currency to 
bo gained by reducing tho filmsotting time of an abstracts journal 
issue from 1-2 days to 1-2 hours is not really significant within 
tho overall timescale of production. 

Returning to the input area in detail, tho major factors are probably 
tho difficulty of dealing with a very largo character sot within tho 
limitations of conventional computer input/output devices, and tho 
problems of automatically validating much of the typo of data which 
tho system handles* 

Tho total INSPEC character sot amounts to some 700 characters. If the 
distinctions between roman, bold, italic, normal, subscript and 
superscript are eliminated, tho sot reduces to Just under 200 discrete 
symbols. It has provod possible, with quito surprising success, to 
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handle this character sot with modified Flexowriters and a wholly 
standard 6 ^ 4 -charac tor line-printer as the proofing medium; but 
nobody who has boon associated with the work would regard this as a 
satisfactory long-term solution* 

Current developments at INSPEC are loading towards the use of on-line 
visual displays for correction and release, perhaps associated with 
an extended character sot line-printer for proofing; with some parts 
of the original input on-line, and others entered by off-line 
keyboarding on to magnetic tape. At the present time wo are still 
sceptical of the economics and the effectiveness (particularly in 
terms of operator acceptability and efficiency) of using visual 
displays for proofreading. 

It is easy to talk of using on-line displays; until recently there 
has boon a real problem in tidying to go beyond theoretical studios. 

In general, computer and peripheral equipment manufacturers are 
oriented towards a commercial data processing market, and towards 
large-scale batch production. There have boon many display terminals 
on the UK markets few, if any, have been suitable for handling 
large volumes of scientific text. This position is changing slowly; 
96-character displays arc replacing 6^-cliarac ter displays as the 
norm, and hardware-generated sot of 192 characters now seems to bo 
feasible at a reasonable price. 

The problem of validating bibliographic records and reducing error 
rates to an acceptable level is a difficult one. The fundamental 
conclusion to bo drawn from INSPECT experience that if an item can 
bo validated in any way by computer program, it is wortli doing. Most 
numeric or alphanumeric codes used ill the production system carry 
check characters. Classification codes are specifically chocked by 
table look-up. Some fields, sucli as the text of an abstract, are 
regarded as wholly intractable. Others, sucli as author names, which 
appear on the face of it to bo almost impossible to validate, can 
in fact bo subjected to certain simple chocks concerned mainly with 
format and presentation. Caro has to bo taken in many instances to 
ensure that the punctuation in the input record is consistent with 
the punctuation which is applied by computer program at the 
typesetting stage. Additionally, whore much of the data consists of 
free text, it is necessary to take careful account of things which 
a keyboard operator may do which would not appoar to be incorrect at 
keyboarding time, but which would cause anomalies in the output; 
eg. the introduction of superfluous 1 space * characters at the end of 
a line. 

There is much scope for the use of authority lists in an input system 
of this kind; those are used in the present system for journal titles, 
classification codes and chapter headings; and at the index 
production stage for subject index headings. In the near future it is 
hoped to implement a direct access system for using the autority 
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file of subject index headings at input; and to study the feasibility 
of a similar approach to dealing with frequently-cited organisation 
names and addresses* 

Despite these problems, the present system has proved itself in 
operation over the past two-and-a-half years, and has been one of the 
first to demonstrate the feasibility of integrating the total 
production of a large-scale abstracting service into a single 
computer-based operation* After some teething problems during the 
first year, the system 1ms settled down satisfactorily and currently 
achieves a throughput time which is notably better than the previous 
conventional operation, both as regards the abstracts journal issues 
(now averaging less than three months from the receipt of a primary 
document to the despatch of the printed abstracts journal) and the 
periodic indexes, which are normally with the printer within four 
weeks of the production of the final issue for the period. 



COMPUTER AIDED TYPESETTING FOR THE ENCYCLOPEDIA JUDAICA 



Meir Doron 

Keter Publishing House , Jerusalem 



SYNOPSIS 

The 16-volume Encyclopedia Judsica prepared by the 
Keter Publishing House is being produced by computer- 
aided typesetting. The system, described in this 
paper, saves tine, ensures easy updating, and avoids 
the need for skilled typesetters. The Index volume , 
which will Include 200 p 000 entries, will be ready two 
weeks after the final volume is completed. 

The Keter Publishing House (s subsidiary of the Israel Program for Scientific Transla- 
tions) has its own typesetting, printing and binding facilities. In 1968 these type- 
setting facilities consisted of Monophoto (1st generation film typesetting) equipment. 
When the firm decided to publish the Encyclopedia Judaica (16 volumes of 800 pages 
each) it was realized that additional equipment would be required, especially when 
considering the very tight schedule which called for publication before the end of 
1971. The management of the firm decided to go into computer-aided typesetting for 
five major reasons. 

1. Possibility of easily updating future editions of the Encyclopedia Judaica. 

2. The esse with which material can be sorted by computer would permit the 
setting of articles In random order as they come out of the editorial depart- 
ment without waiting for a complete volume. This would save months of time, 

3. Making corrections by updating computer tapes rather than stripping in cor- 
rections by hand would solve the acute problem of lack of skilled manpower 
for hand corrections. 

4. The possibility of automating or partially automating the preparation of 
the index volume would save months of time. 

5. Since there were no experienced typesetters available, the problem of train- 
ing a large staff in a short time would be less complicated for a computer- 
aided system. 

The following hardware was used 

1. Input was in the form of paper tape punching equipment with hard copy. 

Originally five Facit 6200 keyboards were purchased but these proved to be 
too unreliable. They were replaced by ten Friden Flexowritera. 
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2. The IBM 360-50 of the Mekoroth company was used to process the material 
on a time rental basis* 

3* A Photon 713-10 Textmaster dviven by a Hewlett Packard 2020 magnetic tape 
reader as well as its own paper tape reader, was purchased by the firm* 

4* For making final corrections directly by hand, Frlden LCC-VF keyboards 
were used* 



The software was prepared by "Natam", who planned the whole system in close cooperation 
with Mr* Moshe Shalvi and the staff of Keter* The tight scheduling required some hectic 
work by the programmers to meet the deadlines* 

Input 

The prime requirement was to make the system as uncomplicated as possible for the in- 
experienced typists who had no knowledge of typesetting. 

Since the hard copy was to be proof-read, it was typed on continuous stationery, each 
page having room for a header and line numbers from 1 to 30* The basic unit in the 
Encyclopedia was the article, which was subdivided into pages and lines* Control codes 
were identified by a square on the hard copy which was a "dead" key so that letters 
could be typed inside it* (On the perforated tape it appeared as a unique code) so that 
the control codes for italic look like thisjT], bold [b; , roman [3 • sraall caps [c] , 

8 pt, 9 pt, etc. [& Ujl f end of paragraph [ti] and many more. If the typist felt 
that she had made a mistake she could cancel one or several letters by hitting the back 
spare key (a unique code), slashing out the unwanted letters, and then typing the letters 
correctly. The proof-reader then simply ignored the slashed letters* The typist was 
not allowed to break words nor wap she required to keep to any line length* Accented 
letters were set in two codes but on the hard copy the letter appeared in the regular 
way. For example, was set thus?"''" (a dead key) and "a". To the proof reader it 
appeared then actually as '*£"« 

Ligatures were ignored by the typist but were converted by the computer* 

The resulting paper tapes were read into the IBM 360-50 of Mekoroth and processed through 
the input stage which formatted the job, checked for wrong codes (vhich were drawn to our 
attention by a print-out of error messages), and held on magnetic tape for future up- 
dating* Meanwhile the proof-readers read and marked the hard copy. 

Corrections to the hard copy 

These were executed by the aame typists* They accessed the proper place for the correc- 
tions by typing (along with the proper control codes) the article number, and page and 
line number on the hard copy* They had the option of correcting the whole line or cor- 
recting "unique phrases" which contained the word before and after the word to be cor- 
rected. The correction tapes were then sent to the computer where they went through 
the input program and were then used to update the original tape in the update program* 
The updated tape was then put through the composition program which cut the text into 
lines and created new line numbers. At this stage, because of the many editorial 
changes that were expected, the lines were still unjustified and no word breaks were 
made* The next stage was the conversion into Photon codes by the computer. 

The reaulting magnetic type *:as converted into type by the Photon 713-10 at a speed of 
about 25 characters a second. A special type matrix drum was designed for the Encyclo- 
pedia Judalca project* Since there was room for 768 different characters, we were abla 
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to include all the required type fonts, accents, and special signs. We used Times' 

New Roman, with italic, bold and small caps, Univers Bold and Medium, Greek, superior 
and inferior numbers and many accented letters. Almost all accents were in the form 
of floating accents, so that a very few places on the matrix drum could produce a 
much larger number of accented letters. Our list of matrices and their different com- 
binations actually come tc r.ore than 18000 characters. 

The resulting bromide print was copied onto sun-print paper and passed onto the proof- 
reading department and frora there to the editorial department for an editorial check 
and possible last-minute updating. 

Corrections to the unjustified first proof 

They were prepared exactly as were the hard-copy proof corrections except that this 
time the typist used the new line numbers as generated by the computer. In addition, 
it was necessary at this stage to arrange the articles in the proper alphabetical order 
Since the rules for alphabetizing are rather complicated it was decided, rather than 
writing a program, to punch IBM cards with Che names of the articles and arrange them 
in proper order by hand. The computer then sorted out the articles according to the 
order of the IBM cards. 

Further, the lines had to be justified now (the Photon actually did the justifying, 
but they had to be cut into proper length by the computer) , and the IBM hyphenation 
sub-program hyphenated the words that had to be broken at the end of lines. This 
program uses a logic system as well as an exception dictionary. This program worked 
very well for English, but did not always break foreign words or names correctly. If 
the break was not correct, this particular word was then added to the exception dic- 
tionary and was therefore broken correctly from then on. 

This new tape was again run off on the Photon with new line numbers down the sides 
(generated by the computer). The resulting justified galley proof was again passed 
on to the proof reading department for checking. 

Corrections to the second prood (justified) 

The whole procedure was repeated as above except that the output of the Photon this 
time was on film rather than on bromide paper. 

Corrections to the film and make-up 

Since it was expected that the film would be almost completely free of errors, any 
minor corrections still to be made would be set on justifying keyboards (Friden LCC). 
The resulting paper tape was put through the Photon and the correction lines stripped 
in by hand. 

At the beginning, we planned for automatic page make-up by the computer, which will 
indeed be used on the index volume (see below). The inclusion of thousands of illus- 
trations would make a fully automatic system impractical especially from an esthetic 
point of view. We found that manual make-up would be faster and cheaper. 

Occasional changes were made in the procedure outlined above: the hard copy correction 
was almost completely abandoned part way through the job. It was found not always to 
be the same as the input tape due generally to some procedural error on the typists' 
part. Besides, it was found that proofreading was much more accurate when reading the 
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actual type rather than interpreting control codes. Occasionally, due to heavy edi- 
torial changes, we introduced another round of corrections for certain articles. At 
other times we skipped certain rounds especially for articles arriving late. 

At the writing of this paper all the material for the encyclopedia has been input, and 
the first ten volumes (out of 16) have actually been printed. The rest of the ency- 
clopedia is scheduled to be completely bound before the end of the year. 

The index volume 

One complete volume of the encyclopedia is to be an index 9 containing over 200 9 000 
items. An index of this size 9 prepared by conventional means would be finished many 
months after the completion of the rest of the encyclopedia. By using an automated 
system, the index volume will be ready only two weeks after the last volume is 
completed. 

Dr. Posner, who heads the index project, together with the people at Natan, devised 
the following system: 

1. The indexers read the manuscript and mark each word that is to appear in 
the index with codes containing serial numbers [I] QJ,[l] |Tj , [T] CD’ 
etc.), the entry-words for each article being numbered from IT 

2. At the same time the indexers prepare a file card, marking down the word 
or words to be indexed, the article number in which it appears and the 
serial number in that article. 

3. The file cards are edited into main items, sub items, etc. and filed alpha- 
betically in boxes. As new items come in, re-editing is often required. 
Several identical items may appear on one card each one with its article 
and serial numbers. 

4. The input typists type each card in its proper alphabetical order, giving 
the item, article numbers and serial numbers. This operation was started 
a short time before all the articles were actually set, any additional 
entries being included as corrections (see below). 

5. Proofs are read on hard copy and then on a Photon run of the computer input. 
All the items in this stage still appear in their unconverted form exactly 
as they were input. Corrections are punched and the material updated. 

6. Since make-up of the encyclopedia itself was not automated, it is now 
necessary to ''inform 11 the computer which article is contained in each col- 
umn of the encyclopedia, (for example: column 532 contains article number 
535403 from line 23 to line 78). This operation proceeds immediately upon 
make-up and will be finished in time for the next stage. 

7. The computer then converts the serial and article numbers into volume and 
column numbers (the columns are numbered rather than the pages). This 
means then that the item is identified as in any index, the reader being 
directed to the proper page. 

8. This is run off on the Photon, corrected in the usual way, and made automa- 
tically by a computer program, which arranges the material in four columns 
with running heads and page numbers automatically inserted. 
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MECHANIZATION AND AUTOMATION IN PUBLISHING 



Lowell H. Hattery 
Professor of Management 
and Public Administration 
The American University 
Washington, D. C. 



SYNOPSIS 

To achieve maximum potential of technological innovation in 
publishing, a higher order of systems concept and man-macUine 
relationship is required, including on-line, inter-active edit- 
ing. Substantial investment must be made in systems research 
and software development. Information specialists should be 
leaders in conceptualization, research and action toward more 
effective mechanization and automation in publishing. 



Mechanization and automation of the printing-publishing process 
has several significant points of impact. 

Technological change in printing and publishing is of’ special in- 
terest to the printing industry for it alters methods of production, changes 
economics and affects organization and managerial processes. It is signifi- 
cant to publishers and to users because technological changes may affect the 
nature of the medium, cost per unit and time of delivery. Technology is of 
interest to the author for the form of his manuscript may be constrained, 
and because he too is interested in time, cost and output form. 

Finally, technological change in printing and publishing is of in- 
terest to the information specialist since it affects the input, the store 
and the output of information for an information system. 

Information Systems Slow to Automate 

A somewhat inexplicable lag in adapting new technology has charac- 
terized information services. Despite the fact that electronic information 
processing machines have been available for 25 years, information special- 
ists have been very slow to mature a concept of achieving optimal integration 
of new technology into operational information systems. 
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One of the reasons for this deficiency is the historical separation 
of the functions of information development, printing, storing, retrieving, 
reformatting, distributing, and revising the store. Even if one institu- 
tion has control over these functions, occupational specialization has 
effectively maintained separation in thought and action. 

Yet the sequence of functions can be viewed as a single process; 
they must be so viewed to utilize new technology optimally. 

Another barrier to achievement of automated systems in information 
has been the absence of standards and standardization. The myriad of type 
faces, different forms of citation, various indexing and classification 
systems, spelling variations, etc., etc., all contribute to confusions in 
the system which complicate and increase the cost of computerization and 
automation. 

Electronic Composition 

One of the most intriguing innovations in the information process 
is electronic composition, also referred to as computerized typesetting. 

Although application of the electronic computer to typesetting 
seems an obvious and straightforward process, it is just becoming a 
significant production factor. A dozen years of fumbling experimentation 
is now clarifying the nature and the economics of the process. Modifica- 
tion and refinements of hardware, software and production methods have been 
developed encouragingly in the last three years. And a better understanding 
of input requirements and of output potential gives a basis for optimism 
for the future. 

Of greatest potential impact for electronic composition is a better 
design linkage with information stores. 

A second important contributor to the growth outlook is the avail- 
ability and utilization of smaller, lower- cost machines, somewhat comparable 
to the development and popularity of minicomputers. The editor of Repro - 
ductions Review said recently: "In the past decade the art of phototype- 

setting has evolved from the prohibitively expensive province of the giant 
publishers to a versatile tool available to both high and low volume user 
at either end of the price scale."-*- 

In its 1970 survey of computer typesetting, Composition Information 
Services, Inc. reported that in the United States 1,400 computers are in use 
or on order for typesetting. CIS also reported in surveys of available 
equipment that the number of "low-priced typesetters able to accept unjusti- 
fied type" has more than doubled in less than two years. 2 

Riley, Itf., "Phototypesetting Todays Where Is It", Reproductions Review , 
April 1971, p. 24. 

2 CIS Newsletter, February 1, 1971 
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Printing industry consultant P. L. Andersson discusses the growth 
of computer typesetting in the following statement: 

The reasons for the popularity of phototypesetters are 
simple and direct. First, much more printing is being done 
today than ever before, and the higher speeds possible with 
electronically driven phototypesetters are needed to keep up 
with the market. Second, more and more printers are convert- 
ing to offset lithographic printing, rather than letter press. 

The lithographic process uses a photographically prepared 
plate, and thus is more compativle with photographic type- 
setting than the letter press process which utilizes the out- 
put of type and line casting machines directly. Third, and 
more to the point, computers have gained a wide acceptance in 
the printing industry as machines which will hyphenate, justify, 
format, edit, and merge corrections and alterations . ^ 

The most critical unsolved problem in electronic composition is the 
development of adequate software. 

Efficient software is a most formidable task if the machine system 
is to perform in the patterns similar to manual composition. Decisions 
about hyphenation, etc. require a very complex set of rules and a large 
store of exceptions: page make-up is a complex task if the decisions are 

to be entirely machine- controlled. 

Unfortunately, software research and development has not been given 
adequate support. 

The requirements for a comprehensive software project are presented 
by John W. Seybold in the proceedings of a 1970 conference on computer 
composition. 4 Mr. Seybold noted the need for "general-purpose computer type- 
setting software which can be used on standard medium-scale computer equip- 
ment." He adds: "I do not believe that any present software is sufficient, 

or adequate, or general enough, or straightforward enough, or versatile 
enough." 

Development of satisfactory general programs for computer composition 
will require a considerable investment of time and money. Standardization 
of many variations among composing conventions is highly desirable. Inter- 
national cooperation in standardization of conventions especially and in 
software development could be helpful to all. 



3 Andersson, P. L., "Phototypesetting -- A Quiet Revolution," Datamation , 
December 1, 1970, p. 22. 

^ Landau, R. M., editor. Proceedings of the ASIS Workshop on Computer 
Composition . Washington, D. C.: American Society for information 

Science, 1971, pp. 87-98. 
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Composition Input 

Efficient input for automatic composition is a principal concern of 
industry specialists. 

High time and cost factors in printing and publishing reflect in 
large measure preparation of a manuscript and the introduction of that 
manuscript into composition of various possible forms. Only recently has 
hard attention been directed to designing input systems which are directly 
related to an automated concept. 

Optical character recognition is an obvious approach and several 
machines are available. 

The principal commercial use of OCR has been the high-volume pro- 
cessing of less than page- size documents such as credit card and charge 
slips. Page readers are more costly. Nevertheless, there is increasing 
use of OCR for publishing. P. L. Andersson estimates that if the type 
font can be controlled, the overall cost of reading OCR scannable copy 
into a typesetting system can range from 1/4 to 3/4 the cost of reading 
perforated paper type. Andersson is optimistic about future applications. 
He adds: 



On the basis of the success to date of these installa- 
tions /OCR page scanners7 and on the continuing improvements 
in technology as well a*s rapidly decreasing costs, application 
for OCR in the graphic arts can today be considered to be in 
the same position as typesetting computers enjoyed in 1963 when 
there were only three such installations. I believe that we can 
look forward to a rapidly increasing machine population. . .new 
pattern recognition techniques, new direct optical computation 
techniques and a growing appreciation of the need for editing 
and correction capabilities will aid in continuing this growth. 5 

Andersson adds that "OCR must be regarded as the only feasible 
electronics system for releasing the graphic arts from its enslavement 
by the fumbling fingers of the keyboard operator." 

Reformulation of the System 

A private study produced for the printing industry by Battelle 
Research Institute was reported in a 1970 conference known as "Comprint 90" 
The report and the conference looked twenty years ahead. In a summary and 
commentary, a trade journal editor stated: 



5 Andersson, P. L. "Optical Character Recognition for Typesetting," 
in Lowell H. Hattery and George P. Bush, editors, Technological 
Chanqe in Printing and Publishing Management. Washington, D. C.: 
SpartanTfooks, Inc. , in press. 
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j Half an industry and half a communications system, 

the printing industry will be the first manufacturers to 
! feel the consequences of a society dominated by information 

j rather than commodities. As the Battelle study put it;, 

"The printing industry will be the world’s first to use 
| the computer as its principal means of production." In 

| this central role, the computer will replace the printing 

! press. 6 

I The versatility of computer production of packaged information is 

1 also noted by Guy Farrell! "Computerized information can be used to create 

■ a book, magazine, microform, facsimile newsletter, or video-cassette pro- 

gram with only variations in procedures. The possibilities for creating 
different products from the information resources available to diversified 
I publishers are virtually limitless." 

\ This flexibility serves too the increasing demand for output of 

i publications in limited numbers of copies -- the "short-run" print-out. 

\ 

J Publishers are in fact developing a new perspective about their 

| role in an information system for a "knowledge-oriented society," according 

1 to Paul D. Doebler, editor of Book Production Industry . 7 Doebler points 

out that publishers have begun to see that "publishing is not simply the 
i business of producing books or magazines -- or for that matter, radio and 

? television programs. It is, rather, the business of moving information 

from those who have it to those who need or want it, by whatever means 

apply." Doebler observes further that, "while information science developed 
[ from a library orientation, much of its essence is basically publishing 

r rather than storage and retrieval." This concept is reflected in a new 

publishers' trade association in the United States which is incorporated 
as the Information Industry Association. 

Electronic Editing 

Thelfacility for editing text for publication on a cathode ray tube 
display is another important element in the publishing system. One editing 
system, for example, displays 50 lines at a time. Corrections can be made 
through a keyboard and immediately appear on the screen. Lines are auto- 
matically justified. An approval signal dispatches the copy for typesetting. 



6 Farrell, G., "The Larger Meaning of Comprint 90," Book Production 
Industry , January 197i, p. 32. 

7 Doebler, P. D., "Publication and Distribution of Information" in Carlos A. 
Cuadra, editor, Annual Review of Information Science and Technology , 

Vol. 5. Chicago, IL: Encyclopaedia Britarmica, Tnc. , 1970, 2*3. 
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One of the most exciting experiments in bringing the author closer 
to the machine and therefore to output has been carried out by Jerry A. 
Carlson, managing editor of the Farm Journal , a large -circulation magazine. 
Carlson himself points out a long Tiistory or fruitless efforts to persuade 
writers to compose on Monotype machines and Teletypesetter keyboards. How- 
ever, the editing possibility on these machines was negligible. 

Mr. Carlson first developed a research project with Iowa State 
University, Within the Journalism Department a "scientific study of video- 
tube writing" was conducted. He reported that the study "established that 
a useful number of our editors would and could write and edit articles with 
on-line video terminals." 

Although there are several ongoing examples of on-line, video screen 
composing and editing, the process is still in early stages of development. 
Carlson analyzes its usefulness for three writing situations: 

1. The "quick-and-rough" writer under pressure will key his story 
into a video terminal, a rewrite editor will call it up onto his terminal, 
correct it with light pen and keyboard and send it to the computer. The 
article "never saw manuscript paper, paper tape, red pencils, green pencils, 
or blue pencils." 9 

2. The creative author, such as the book author, will key in his 
text. "He can just close his eyes and type, never worrying about carriage 
returns, jammed keys or typos. He simply empties his heart into an 
electronic heart miles away... Next morning, the author can scroll up 
through his previously written thoughts, correct the typos, repair his 
transitions and perform other insertions and deletions." 

3. The author who must develop highly polished or concise copy 

"will insert, delete and revise a great deal as he goes along," according 
to Carlson. One of Carlson’s editors commented: "The words become almost 

physical, plastic -- you can hammer and pound and sculpture them into 
shape . " 



Carlson summarizes his view of video editing: "... the new tech- 
nology offers writers and artists wide opportunity for greater aesthetic 
and intellectual achievement." 



8 Carlson, J. A., "Video Editing" in Hattery and Bush, editors. 
In press. 

8 Carlson, 0 £. cit. 
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Output 



Many significant refinements have occurred in the press room. They 
include the introduction of the belt press, automatic quality controls and 
automatic process control. On the other hand, these changes are evolu- 
tionary rather than revolutionary. 

Revolutionary changes in output are called for by the nature of 
changes in the total printing -publishing technological, managerial and 
distributive system. The possibility of storing encyclopaedic information 
in computer readable form facilitates "on order” publishing. 

Numbers of copies of output may range from one to a very large 
printing. In fact, the output may be "published” on a video screen. 

Producing the output on film in a photocompositor or from a 
computer- output- on microfilm (COM) device, provides a flexible base to 
produce either film or hard copy output in size ranges and varying 
numbers of copies. 

Management of Printing - Publishing Technology 

Although the problems of the new technological systems at the oper- 
ational level are indeed challenging, it is at upper levels of planning and 
entrepreneurship that mind- stretching and vigorous initiative are called 
for. 



That printing- publishing is a complex process has been noted. That 
the steps in the process are increasingly interdependent rather than dis- 
crete has also been noted. However, the role of information science planner 
and manager has been ignored except inf erentially . 

Yet the planning-managerial implications of mechanization and auto- 
mation are most significant. They call for a new total concept which should 
be the context of all managerial and planning action. 10 

Centralization of planning and managing is essential, even though 
operational decentralization is possible and for some processes supported 
by the new technology. 



For a case study of broad managerial concept which incorporates computer 
composition and other advanced technology, see the doctoral dissertation: 
Bracken, M. C., "An Analysis of the Evolution of the National Library of 
Medicine: Implications for the Development of Scientific and Technical 

Information Networks," The American University, Washington, D. C., 1971. 
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Whether centralized concern with the total process of creating text, 
processing text, "typesetting" or composing, "printing" and distributing is 
with a business organization, a professional organization, or a governmental 
organization does not modify its necessity. 

In the United States, industrial mergers reflect these forces of uni- 
fication and centralization. Many information scientists and policy 
officials have been slower to recognize the inevitability and significance 
of organizational response to printing- publishing technological change. 

The potential contribution to society of balanced support for 
elements in the process is very great. The "information problem" can be 
brought under control with present and emerging technology if policy 
planners and managerial statesmen turn attention to it and extend support. 

Research and the Information Specialist 

Comparatively small investment has been made in research and develop- 
ment in the printing-publishing process. Most fundamentally significant 
innovations are "fall-outs" or "spin-offs" from research for other end 
objectives. The small direct investment has generally been applied to very 
specific objectives which have localized significance but tend to be trivial 
in relation to the total system. 

Similar criticism can be levied against planning and support of 
research and development for science information. 

A larger strategy for research and development and larger investment 
of resources is needed critically if technology is to be utilized anywhere 
near its potential. 

Information specialists can be especially helpful in: (1) planning, 

developing and operating information banks; (2) assessing in advance the 
needs of users; and (3) providing specifications for outputs from the system. 
In order to carry out these functions, the information specialist must think 
and operate in relation to the new technology of printing-publishing as an 
interrelated system. 
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MR t E % OFFENBACRER (Ieixiel): Reprography and information centers are well known sub- 

jects; however it is desirable to note what is special about the needs of developing 
countries in this field. In developed countries there is a steady flow of incoming 
requests for information to the information center. In a developing country, or as it 
i 8 more adequately called, in a young country, it is necessary to stimulate the use of 
the information center. The information needs, therefore, are quite different from 
those of industrialized countries. The material to be published and distributed by 
the information center is also different, and so are the reprographic needs and 
possibilities. 

Because of the rather difficult economic and administrative situation, the trame- 
work in which information centers have to work in a young country is quite different 
as far as the technical means are concerned. The famous slogan * what is good for 
America is good for you - is true only for industrialized nations. This is not be- 
cause techniques developed in America, England, Russia or any of the big industrial 
countries haven’t proved themselves, but rather because the economics of their uae is 
quite different in a young country. 

I would like to give a few examples of the sort of economic considerations I am 
referring to. To use micromaterial you need readers. There exist reasonably priced 
readers. For instance, there is an American product which sells in the US for $80 
which is a very reasonable price. Translated into Israeli pounds, this is about 
IL.350. Add insurance, shipping costs and taxes and it comes to IL 650. You can buy 
approximately the same things in Israel with an Israeli pound as in the States with a 
dollar. So 650 Israeli pounds would mean in America $650. In the States $650 is a 
considerable price for a microfiche reader and it would severely limit the possibility 
of popularizing the microfiche system. This is a typical problem of developing 
countries. 

Another example is the calculation to be made to find out if a certain piece of 
equipment is worth acquiring. The depreciation of reprographic equipment is calculated 
at about five years, but in practice only three years are considered realistic. After 
three years, there will be a new model on the market or even a new technique which will 
make It desirable to acquire new equipment even if the old equipment is still usable. 
After three years in an industrialized country, such as Germany, England, Holland or 
Sweden, the machine will have amortized itself. In a developing country, this is 
doubtful. Then there is the choice of throwing away a machine which has not yet amor* 
tlzed itself or of foregoing the benefit of an improved model. The question of invest- 
ments and other economic considerations is thus quite different in a young country. 

There is another falacy which I would like to mention here. We often hear that 
(t any office girl can do it* 11 Any office girl can indeed push a button, but to be sure 
that the machine always responds when the office girl pushes the button, we must have 
highly skilled technicians who can maintain and repair the machine. This again is a 
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problem in young countries. Being situated far from centers of development in Europe 
and the US they don’t always have experienced technicians and this makes it impossible 
to use the same sophisticated machinery which works beautifully in the more developed 
countries. On the other hand, there are prestige motivations to acquire sophisticated 
equipment which can’t be profitably used in a developing country, 

MR* M* DORON (Israel) t There are two reasons for computer-aided typesetting of the 
Encyclopedia Judaica. The major reason is the shortage of trained compositors in 
Israel. We would have needed for the Encyclopedia around 20 trained compositors which 
is out of the question. You just can’t find that many people on the labour market. 

We therefore set type on f lexowriters, which require very little knowledge on the 
part of the typist, A compositor takes at least a year or two to train. Our typists 
were working after one week of training. 

Rather than go through a complicated mechanical procedure, we simplified a lot of 
operations; for example, every time we wanted to set italic type, we simply had the 
typist 8 trike a dead key which doesn’t move the carriage along, in the form of a 
box. This punched a unique code on to the paper tape. After that for italic, she 
punched the letter "i". So |i| preceding a group of words or even pages would cause 
everything that comes after it to be set in italics. 

This also simplified the preparation of the index volume. The index editors, ra- 
ther than waiting until the whole text had been set and made up and then going over it 
and deciding which words should be in the index, took the manuscript and Indexed that. 
In other words, if there was a certain word, for instance "Moses", which we wanted to 
index, the word would be marked on the manuscript in front of the word Moses |T) flj , 
The capital (Tj means that this is a word to appear in the index. The 'T| means that 
this is the first word in this particular article that is to be indexed. This is for 
later accessing. These codes did not appear in the proofs of the text itself but were 
in the margins for checking that the computer accepted them. 

The indexer, at the same time as he marked up the manuscript, also prepared an 

index card. On the card he wrote "Moses", the article number, and the number of this 

particular word in the article that was to be indexed. He then manually sorted the 
cards in alphabetical order. He also edited the cards if necessary. "Moses" may 
not be a main item but a sub-item to some other main item. The cards were edited 
constantly until all the articles were included. Then they were put on paper tape on 

the flexowriter, and set with the word "Moses", the article number and the index num- 

ber, Since wo did not use automatic makeup on the text itself, we had to inform the 
computer exacLly what article appeared on which page. The computer, as soon as it 
knew the article and the index number where a particular word appeared in an article, 
could automatically convert the article number into actual volume and page number. 

This is then printed out on our Photon 713, automatically made up into pages, and 
after corrections, is ready for printing. 

We used the computer for several additional purposes, e,g. sorting of articles. 

We didn’t set the articles in alphabetical order. The computer sorted them out and 
every time we were ready to close a certain article we could sort it out very simply. 
Anyone who has worked at a printing house would know that this wcvdld have been impos- 
sible to do without the computer. 

In addition, the computer justified the lim*s and the typist did not have to do 
this at all. We used automatic hyphenation which only experienced compositors could 
do. Corrections were handled by updating the computer memory. Make-up of the index 
was also done automatically, a time consuming operation in any printing house. 
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All these things may not have saved money on this project, but undoubtedly saved 
lots of time. The index volume will appear two weeks after the last volume is printed. 
The whole encyclopedia will be ready by the end of the year, 

PROF. L,H. HATTERY (US); My principal objective is to try to generalize for the in- 
formation scientist the impact of the evolution, and to some degree, the revolution, 
of the technology of reprography and of printing. My thesis is that the processes of 
printing, of publishing, of librarianship and of information science are merging into 
a single complex process. It has been interesting to note that in very few meetings 
of information scientists has the topic of printing been identified as a major topic 
of concern, and it seems to me significant that at this point in time, at this Confe- 
rence in Israel, that it should be identified as a major program topic. In our curri- 
culum in information science at American University, we have had now for about three 
years a separate course on the automation of the printing process. 

It is perhaps interesting to note that despite the general lack of attention by 
information scientists in professional meetings and professional journals to the tech- 
nological evolution in typesetting, nevertheless, the primary impetus for the develop- 
ment of automatic, electronic typesetting did come from information scientists. And 
I think perhaps If one were to write the technology history, the information scientists 
at the National Library of Medicine would deserve a large amount of credit. 

The papers presented by ray colleagues hare have been very impressive and have indi- 
cated the degree to which the advanced and sophisticated technology of reprography and 
printing is already operational. In the States, there are more than 1400 computer 
typesetting installations. More than one-third of the newspapers are being composed 
by computer. Although there has been a great deal of discussion for a number of years 
about other technological developments such as the use of optical character recognition, 
there has not been much operation, so we are noraewhat inclined to discount the likeli- 
hood of these technologies becoming significant in the near future, I suggest that 
the technology and the economics of optical character recognition for the automatic in- 
put of information into composition systems, for example, is not likely to be a signi- 
ficant development within the next five years. 

The important issue is to try to identify the useful and necessary implications 
of these developments for us as information scientists. One is that we must be pre- 
pared to accept lower quality output. There are examples of high quality output which 
suggest that it is possible with the new technologies to produce high quality material. 
Nevertheless, this is at the cost of more time, and a great deal of effort. There are 
some who are upset when they see justified right hand margins, and some when they see 
ragged right hand margins, because it seems that the esthetics of justifying right 
hand margins may not be economically justified within the practical problems of commu- 
nication, I would suggest, at the risk of horrifying some of my colleagues, that 
hyphenless composition may be acceptable in many fields. Some of the newspapers in 
the States converted to hyphenless composition and very few of their readers noted 
this conversion. I think the same thing may be true for the readers of scientific 
and technical information. 

Another implication of technological innovations is the expectation of highly 
mixed media for any complex - hard copy, microforms in various aspects, tape displays. 
This has grave implications for our forward planning. 

Let me suggest several of the specific problems that we ought to be taking into 
consideration in forward planning. There has been some reference to the matter of 
building to hold new collections of books. One of my battles Is against library 
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architects. In the information sciences we who need to build a new building must 
depend on architects who have been trained to build a building to hold books only. 

Even within ray own university, the principal concern and goal of our university li- 
brarian has been to build a large new library to hold more volumes. It seems that, 
the investment ought not to be in physical facilities for traditional hard copies 
but rather for the mixed media which are already available and which can be made 
available to substitute for hard copies, 

' A second concern in the area of planning is the establishment of networks. Those 
who are engaged in planning networks beginning with UNISIST through to component net- 
works, need to have greater input from information engineers to devise standards for 
media mixes which are optimal for large networks. It seems to me that because most 
of the documentation that relates to network planning does not take technology ade- 
quately into account, plans will prove obsolete within five to ten years. 

Finally I suggest that the developments that I have been referring to in technology 
and their applications to major forward planning in terms of facilities, equipment, 
personnel and networks, call for a substantial degree of centralization of planning 
and support. In the US and other countries, governments and professional associations 
have not given adequate attention to financial support for the kind of planning and 
preparations necessary for the effective and optimal use of the new technology, 

DR, M. CREMER (Germany) : I would like to mention two possible applications of raicro- 

fiche> The first is in the distribution of doctoral theses. In my country, we have 
the problem that, on the one hand, the students and the faculty do not want to shoul- 
der the financial burden of printing theses in conventional form. On the other hand, 
the university libraries are very keen to get copies of theses and dissertations for 
exchange purposes. We started to develop a national program to put doctoral theses 
on microfiche and I hope to convince my library colleagues to accept this program. 

The second application relates to archives and office records. Large offices 
like governmental departments, archives, big scientific organizations, etc, put their 
records on microform not only to save space, but also to have access to them through 
a combination of microform and computer search, thus making the records of the office 
or of the archives available as a kind of integrated information service. 

MR, OFFENBACHER : To Dr, Cremer • I have found that new Ph.D.’s who Just presented 
their theses are rather reluctant to make them available before they have appeared 
in the press. Only a few copies are required for the university authorities. The 
question is if microfiche is economical under these circumstances, since I doubt 
that professors would be prepared to read the microfiche in order to give their 
opinion about the thesis. 

MR, D f 0LIER : In France we have a program for all French speaking countries, many of 

which are developing countries, so the question of price of micro -documents is of 
paramount importance to us, 

MR. P. TURA (Ierae l) : To Mr. Thiele and Mr. Offenbaoher. Because a small country 
can’t afford to buy a lot of sophisticated and expensive equipment, the manufacturers 
and designers of this equipment should get to work. Microequipment for the viewing 
of microfilm and microfiche could be produced at a cost not exceeding by much that of 
a pair of good spectacles. I have seen some readers for viewing film slides for 
tourists and possibly this or a stereoscopic viewer might be adapted for reading 
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microfilm, at low cost. This would allow every scientist to buy a cheap microfilm- 
microfiche reader which would operate in daylight or on batteries. Microfilm-micro- 
fiche can also be viewed with a slide projecter, 

MR, G, THIELE ( Germany) : To Mr. Turn , A manufacturer can make cheaper readers if he 
can make them in large quantities. This is only possible if he is sure of a large 
market. If there are various microfiche sizes you have to have various types of 
equipment. In Frankfurt we have an automatic enlarger for microfiche, 

MR, OFFENBACHER: To Mr, Tura • I am afraid what works well for a tourist looking at 
slides wouldn’t work as a reader for microcopies. There are too many details to be 
read which we don’t look for in a snapshot, 

MR. V, PUMP I AN (Israel) : To Mr, Thiele, Could you give us some more details about the 
plan to publish abstract periodicals? Is it done in co-operation with publishers? It 
would be useful to shorten the period between the time data is collected and the time 
it is published. 

Mg, THIELE : To Mr. Pvmpian, With the periodicals there is some trouble, mainly the 
problem of copyright. When there will be some agreement on copyright I think there 
will be agreement between the producers and the users, 

MR, L, PAPIER ( VS) : Research is being carried out in the United States on the accepta- 

bility and effectiveness of microfiche. One study has been completed at the University 
of Denver on the use of microfiche. We have a big project with the Am. Assoc, of 
Junior Colleges in a learning situation with the whole course produced on microfiche, 
highly controlled where variables are determined. These variables are things like 
whether it is data or information, positive or negative film, type of reader, etc. I 
just want to bring out that we do have a little more hard data on microfiche, micro- 
form than we had in the past, 

MRS, S , WEIL (Israel) : Users of microfilm or microfiche use microform collections for 

eliminating not needed material. If they find a reference or if the SDI service brings 
to their attention a document which exists only on microfiche they come to our library 
and read only the abstract or the main points. If the document is of interest, they 
ask for a full copy. The microforms are more for the archives than for the user, I 
would like to know if my colleagues in other countries have the same experience or if 
their users are really reading the microfiche and using them at home with a reader. 

MR , Y, SHALEV (Israel): In the documentation unit at the Ministry of Defence we 

recently distributed some 35 readers to our users. The price of each reader is 
$60 - $70 and is paid off in a year If only 35 reports of MPIS are read per annum, 
as the microcopy of the report costs only 95 cents. Another economic point which 
should also be considered is that storage space in the Tel Aviv area is IL 1,000 
per sq./m. per year, 

MR , OFFENBACHER : To Mrs. Wei U You are quite right. Microcopies were never intended 
to be read but only to be scanned. When it becomes apparent that a whole publication 
is to be read, the text should be enlarged, 

MR, THIELE : To Mrs . Weil . I agree that it is impossible to read a whole microfiche 
of 60 pages. We can make copies in about 3 minutes, and the price for these 60 pages 
is 12 Marks, 
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If a publisher produces a printed periodical and a microfiche, in Germany he re- 
ceives the same price for both. In the case of reports, there is only the microfiche 
as the manuscript is used only for filming. 

MRS* WEIL: To Mr, Thiele. It seems very important to have the possibility to acquire 
journals both in the form of microfiche and the full size copy. For browsing, readers 
need the latter, whereas for back searching the microfiche is very suitable* If we 
had a combination price that would allow us to buy both at a reduced price, we would 
save storage space as we could discard full size back volumes. 

MR. U. BLOCH (Israel) : I don't see any reason why someone who wants to buy the full 

sized journal and the microfiche should pay the editor and the author twice. As 
Mrs. Weil said, we need the hard copy because people want to browse and we need the 
microfiche to save storage space. Why should the microfiche not be available at the 
actual cost of preparing it which would come to something like ten per cent above the 
price of the hard copy. This would enable librarians to start working in a new direction. 

To buy the film later is expensive and if the microfilm is prepared not from the 
original its quality is entirely different. If there are any representatives of pub- 
lishers present, they should take this discussion into consideration and think about 
offering the microforms at a reasonable price. 

In addition, I'd like to know whether the speakers have any idea of what will be 
the next step in the development of faster typesetting machines. Perhaps on-line 
displays photographed directly to the film, and then shipped to the printer might be 
the aext step, 

MR. P; SIMMONS ( UK ) ; To Mr. Bloch. New composing machines come out every day. Speeds 
of something like 10,000 characters a second have been recorded. People have problems 
with powerful computers keeping up with them, I can give you the names of these ma- 
chines but they are probably meaningless. Speed at the composing end of the system is 
not the problem. Time span from the acquisition of the material to its actual publi- 
cation is very much longer than the two or three days of work on the 713, This could 
obviously be reduced to a few hours but with a time schedule of two or three months, 
it makes little difference. 

DR. S. SCHWARZ (Sweden) : To Mr. Zuohel. The question of transformation of text to 
magnetic tape is one of time saving. You have a system where four-punches represent 
one character. The Japanese have developed a system which enables them to represent 
about 2,500 characters with two-key punching which is done not sequentially but simul- 
taneously, This punching seems to go with great speed. Have you a comparative economy 
of these different systems? 

MR. SIMMONS : To Dr. Schwarz. We have a similar problem with some 700 different charac- 
ters we need to encode. We have adopted the technique of taking one character followed 
by a shift number of which there are six, and following that with a specific character. 
For example, we would take a character which corresponds directly with the paragraph 
sign and follow it with "1" for the Greek alphabet, and "a" thereafter would indicate 
alpha and so on. 

MR. R. ZVCHEL (Gemcmy) : To Dr. Sahwarz. A system of prototypes which we use is open 
ended; the only thing you have to do is cover the frames of the composition machines 
with the systematic tables for Japanese characters, for example. You can put in these 
tables what you want; all characters are possible. 
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DR. SCHWARZ : To Mr. Z uohel. It was a question of timing • Does it go as quickly as 
punching two keys simultaneously? 

MR* Z UCHEL : To Dr \ Schuarz. Yes, it is a question of timing, but timing is not in- 
teresting in this case. The problem is to record all the data without loss of informa- 
tion* We have solved this with this system* You do need more time for four represen- 
tatives of one character. 

MR* L. ELPERN (Israel) : To Mr. Doron • Are you considering the possibilities of making 
any form of micro-reproduction of the encyclopedia? Is the body of knowledge accumu- 
lated in this encyclopedia going to be updated in some way? Are you considering any 
kind of secondary publications from the accumulated knowledge? 

MR. DORON : To Mr. Elpem • No microphotography is anticipated as yet, but we will con- 
sider your idea. As for future updating, that of course is one of the prime reasons 
that we use the computer* Future editions of the Encyclopedia will be much easier 
and quicker to update than would have been ordinarily possible* Updating within a 
few years is anticipated* The third point - using the accumulated knowledge in other 
forms, such as separate books or other publishing ventures, is also being planned* We 
plan many side products to the Encyclopedia* A huge amount of knowledge has been 
gathered and since it is computerized, it is readily accessible* 
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THE INTERNATIONAL NETWORK OF ISI® -LINKED NATIONAL INFORMATION CENTERS 

Eugene Garfield, Ph.D. and Morton V. Malin, Ph.D. 

Institute for Scientific Information, Philadelphia, pa., u.S.A. 



SYNOPSIS 

During the past six years The Institute for Scientific information 
has, through private investment, constructed a multidisciplinary 
machine -language data base covering nearly two million source arti- 
cles. This file is increasing at present by approximately 400,000 
items per year or 8,000 per week, use of ISI*s information services 
offers a method of establishing an effective national information 
system at a cost acceptable to smaller and developing nations. 



Customarily, discussions of information networks are concerned with the concept 
of interrelating the activities of information centers on a national or international 
scale. Only slight, if any, attention is given to the already existing international 
network for scientific information exchange which is the scientific community itself. 
Scientists have disseminated information and communicated with one another using a 
variety of methods, but primarily through the published literature. (journals) , cor- 
respondence, scientific meetings and more recently telephone and other modem media 
of communication, for several hundred years. 

It is understandable that discussions of information networks do not become 
involved with these first order networks but concern themselvos with, what I shall 
call, second and third order networks. The second order networks are those intended 
to establish a system of information interchange on an international level, and the 
third order networks are those which involve systems for linking national information 
centers. The concern with second and third order networks is understandable because 
the processors and disseminators of information are faced with the problem of manag- 
ing the information flow that results from the research and scholarly activities of 
the scientific community. The concept of network systems, international or national, 
reflects the desire to make the output of the scientific community more readily avail- 
able to a greater number of potential users of information, and to do this more ef- 
ficiently by establishing methods which will reduce duplication of effort. These 
objectives are important because all of us are aware of the duplication of effort 
that, exists and of the high price that is paid as a result. 

Too often# however, the demand for efficiency translates itself not into making 
better use of that which already exists or improving the operation of existing 
systems, but into developing new and different methods and systems. 
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ISI *3 approach. to establishment of a network of internationally- linked infor- 
mation centers has been to develop better methods for dealing with the communication 
system that already exists among scientists and to expand and nourish it* This is 
done not only by computerized services , vAiich is the main focus of my talk, but by a 
variety of services, both manual and machine oriented, that play a significant part 
in isi's role as a processor and disseminator of information at all levels — indi- 
vidual, institutional, national, and international* 

Before considering the relationship of isi to the information centers located 
in more than a half dozen countries in which ISI tapes are used, it is important to 
describe isi *3 approach to providing the totality of its services to individual 
countries Tn their efforts to establish national information centers which can link 
up to the ISI network* 

151 *8 services can be characterized both functionally and by subject area. The 
first category includes: 

1* Current Awareness Services 

2* Retrospective Searching service 

3* Selective Dissemination of Information Service 

4. Library Service* 

Although the subject areas covered in ISI services include every field of scholarship, 
I shall delineate these areas broadly as follows: 

1* Life Sciences 

2* Physical Sciences 

3* Chemical Sciences 

4. Agricultural Sciences 

5* Engineering and Technology 

6* Behavioral and Social Sciences 

7* Education. 

I mention these subject areas only to indicate the broad coverage available through 
ISI's services* More important, I believe, is the description of the types of 
services offered as categorized by function, although I shall at a later point talk 
in some detuil about ISI *s chemical information service* 

Using as input approximately 5000 journals from almost every country in the 
world, current awareness service is provided through the medium of five Current 
Contents® editions* These are? 
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1* Current Contents/I*ife Sciences 

2. Current Con ten ts/Phys leal fc Chemical Sciences 

3. Current Contenta/Agricultural, Food, L Veterinary Sciences 

4. Current Con ten f /Engineering S Technology 

5. Current Contents /Behavioral , Social , 6 Educational Science a 

The purpose of Current Contents is to provide prompt current awareness of research 
results published in the leading journals of the world in the broad areas shown 
above. Each week the contents pages from the journal issues received at ISI are 
reproduced in one of the five Current Contents editions. Each Current Contents also 
includes an author address directory which readers can use to send to "autKorsTfor 
reprints of articles they have found through Current Contents . It has been estimated 
that five million reprint requests per year result from use of Current Contents . An 
important feature of the current awareness service is the fact that ISI has estab- 
lished relationships with publishers throughout the world which enable us to obtain 
journals quickly j the contents pages of many journals are obtained in advance of 
publication* 

Retrospective searching capabilities are provided by ISI through the Science 
Citation Inde^ (SCI®) and Permuterm® Subject Index ( PSI ) and 151 *8 Search Service* 
The SCI and PSI are comprehensive multidisciplinary indexes which provide a variety 
of unique methods for retrospective searching of the scientific, agricultural, tech- 
nological, and biomedical literature. The SCI provides access to the literature by 
author, citation, or organization, and the PSI is a natural- language subject index 
which complements the SCI , providing access to the literature by words taken from 
the titles of current articles processed into the SCI data base. 

The SCI data base is derived from a core of approximately 2500 of the 5000 
journals received at isi . Each journal issue is indexed into the system from cover 
to cover and all substantive items are processed. The SCI and PSI are published 
quarterly with annual cumulations. In 1971, the data base will consist of approx- 
imately 400,000 current articles and 4,000,000 citations. This year will also see 
the publication of a cumulative SCI covering the years 1965-1969. The total SCI 
file now includes over 1,500,000 source items and 20,000,000 cited references. 

Retrospective searching is also provided through the ISI Search Service *4)1 ch 
is used by many subscribers when they require special literature searches to be 
performed. 

SDI is available from ISI either through ASCA® IV, the Automatic Subject Citation 
Alert service operating at ISI *s headquarters in Philadelphia, or through lease of 
the Science Citation Index tapes which can be used by organizations to provide SDI 
to their staff. The data base used for ASCA , or available from ISI tapes, is the 
same one which is used for the SCI . SDI services, however, are provided weekly 
through subscription to ASCA or, if tapes are used, these are also supplied weekly 
to the lessee. The ASCA service is unique in the fact that citation questions, as 
well as words, can be used to develop the user profiles* Tape users also have this 
feature available to them if they wish to obtain the citation tapes as well as the 
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source tapeu, and very important is the fact that the ASCA software can be bought or 
leased from ISI . 

Among the functional service areas mentioned previously was I ST 1 s library 
service. This service called OATS®, Original Article Tear Sheet service, was 
designed to provide users of ISI 1 g system with the means of obtaining hard copy of 
any article which their use of any of the current awareness, retrospective searching 
or SDI services uncovered, when such articles are not readily available through reg- 
ular library channels. In fact, In many cases the OATS service Is used by libraries 
and Individuals even If the journals ure available in their libraries because of the 
convenience and the speed with which they receive tear sheots and because, on a cost 
benefit basis, the sorvlce Is relatively inexpensive. Some ASCA customers have 
requested that articles reported cn th^ir printouts be sent to them automatically* 
we call this service ASCAMATIC. 

I have only, In the preceding, described very briefly the four major service 
functions performed by ISI %Ailch ISI feels are solutions to the four basic aspects 
of the problem of disseminating scientific and technical Information. These solu- 
tions are Keeping users of information current on scientific and technological devel- 
opments! Current Contents , Keeping them informed on a selective basis of develop- 
ments having direct bearing on their worK* ASCA and Tapes, enabling them to learn 
qulcKly what has been published In the past* SCI , retrospective searching capability* 
OATS , enabling them to retrieve documents qulcKly and efficiently. 

I stated previously that ISI has a specialized chemical Information service 
which I shall describe briefly before commenting upon the ISI -llnXcd international 
Information networK. 

ISI * s chemical information services group consists of the publication Current 
Abstracts of Chemistry and Index Chemlcus™ ( CAC&IC 1 ^ , the Index Chemlcus Registry 
System^ , and the Chemical substructure^ Index 7 * 1 . Current Abstracts of Chemistry and 
Index Chemlcus Is a weekly abstract journal of the chemical literature which places 
special emphasis on reporting new compounds and relations. In addition to the tra- 
ditional abstract, structural diagrams and reaction flow diagrams are also provided. 
Indexes to the wceKly Issues aro published monthly and cumulated annually. Current 
Abstracts of Chemistry and Index Chemlcus Is tho source for preparing the Index 
Chemlcus Registry System . The compounds reports in CAC&IC are encoded into 
Wiswesser Line Notations, which pormitB preparation ol magnetic tapos that are 
computer searchable. Searching can be by compound family, substructure, biological 
activity, authors, journals or Index terms since tho tapes contain not only the 
encoded compounds, but also bibliographic Information for the article in which the 
compounds were reported and Information regarding tho other searchable items men- 
tioned. Tapes aro provided monthly and programs are available for use both for 
retrospective and current awareness searches of the file. 

The Chemical Substructure Index is dorived from the I CHS® and Is a published 
permuted listing of tho compounds encoded into WLN. This Index provides tho capa- 
bility of doing substructure searches for tho most rocontly reported chomical 
compounds by manual means. It Is published on a monthly basis and will bo cumulated 
annually. 

My purpose In providing this brief ovorvlow of I SI sorvlces Is to lay the basis 
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for describing how ISI serves as a mechanism for linking information centers and 
explaining how ISI 's services can be obtained on a national level to help in 
development of national information systems. 

1S1 is a supplier of its SCI data base to national centers in five countries 
for SDI uso and also supplies these tapes to approximately half a dozen private 
organizations which use them for internal SDI services. The ICR S tapes are 
presently supplied to a number of private chemical firms but are also available to 
national centers. 

In making these tapes available on a lease basis, I SI has, in fact, already 
aided in tho establishment of an international network of information centers in 
which ISI , as the processor of information produced on an international scale, 
servicos as a distributor of reprocessed information to an international clientele. 

Tho present system is not a formalized one with rules, by-laws, standards, etc. But, 
there is no reason why the process cannot become more formal. In fact, since ISI 
is prepared not only to lease its tapes but also to lease or sell its whole ASCA 
software system and to help in its installation and guarantee its operation to any 
country, many of the obstacles which inhibit establishment of networks are eliminated. 
There exists at this very moment the capability for every country represented at this 
meoting to have, without the need for excessive development costs, a national SDI 
system, simply by obtaining from ISI the rights to the ASCA system. 

Even more important, the whole ISI system consisting of the four components, 
current awareness, SDI, retrospective searching capability and library service, is 
also available because ISI is prepared and willing to cooperate with national in- 
formation systems to supply all its services and publications to a central authority 
for redistribution to the scientific, engineering, technological, educational, and 
managerial communities in each individual country. 

Such an arrangement has already been negotiated with the Ministry of Education 
in Spain, negotiations are now in progress in several Latin American countries for 
obtaining all ISI *s servicos on the national level. Negotiations have been com- 
pleted with a representative of the Japanese Government for use of the ASCA and ICRS 
system on a national levol in that country. I am also sure you know that ISI tapes 
are used by the National Research Council in Canada, the Royal Technical Institute 
in Sweden, and by COSTI hore in Israel. 

In its arrangements for sorvico on a national scale as, for example Spain, ISI 
provides to tho central authority or national system bulk shipments of the five 
editions of Current Contents and Current Abstracts of Chemistry and Index Chemicus , 
sets of tho SCI for placement in universities, research institutes and other libraries, 
providos OATS sorvico through tho national center, and Search Service on demand. 

We aid in the establishment of a national SDI system through a phasing-in process 
by supplying ASCA from Philadelphia until tho computerized system is installed locally 
and by providing technical and educational service. Those services aro designed to 
guarantee the operation of tho system and to train both tho administrators and users 
of tho system in tho techniques of profile preparation and refinement, and in man- 
aging tho systom, I wish to stross very strongly that ISI takes its responsibilities 
in such cooperation very seriously because wo insist on providing training to local 
personnel on tho operations, administration and uso of our system. The management 
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problems of building a system are well known to all of you and the literature is 
full of articles reporting on these problems; IS ** 8 objective is to eliminate them. 

IS1 stresses that it will provide complete administrative and management 
support in connection with the supply of its services. This support includes 
advising on the type of organizational structure that should be established for 
the system and in providing training for clerical and technical personnel itoo will 
be responsible for operation of the system. This training will include instruction 
in record keeping, acquisitions procedures, and methods for establishing distribution 
procedures for the published and computer-generated services, and for establishing 
financial and administrative procedures to insure efficient operation of the system. 

In regard to the ASCA system, ISI provides as part of the software lease full 
documentation of the programs. Its computer systems personnel will help install 
the programs at the local computer center, and train operators and systems and 
programming people in its use. ISI will also provide training on profile prepara- 
tion and profile input processing at the local center and to personnel sent to 
Philadelphia. Finally, I should stress that ISI is prepared, as part of its training 
responsibility, to develop instructional materials on all its services in the local 
language, 

I shall conclude my talk by saying that it is universally recognized that the 
future growth of any country in the world today depends on developing a strong 
educational and research and development capability. Development of this capability 
is a major requisite for establishing the base for future economic growth and solving 
the pressing social problems faced by all nations. A major contributor to the solu- 
tion of these problems will be properly controlled and disseminated information to 
the individuals who are engaged in working toward the solutions. The cost, to any 
nation, of solving the information problem by itself is very high and in many cases 
actually prohibitive. The answer, consequently, is to take advantage of resources 
already available as well as to develop systems to meet specific individual needs. 
Recognizably, no country wishes to remain completely dependent on outside resources 
indefinitely, but’ also recognizable is the fact that economic necessity and good 
management require that the development of national information systems be done in 
the most efficient manner possible. Exploiting available resources which are cheaper, 
costwise, than development of new similar resources is an important way to keep devel- 
opment costs low. 

I suggest that effective national and international systems and networks can be 
developed by linking into available systems mich as ISI/s. 
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SYNOPSIS 



The National Science Library^ national SDI Service, after two 
years of operation, is proving to be a successful and valuable 
service, Employing six data bases covering 900,000 papers 
per year, it provided in 1970, 52,000 tailor-made bibliographies 
to 1400 users. Steps in the development of the service are sum- 
marized, strengths and weaknesses of the system discussed, and 
plans for future development outlined. 



The Canadian Selective Dissemination of Information Service (CAN/SDI), a na- 
tional computerized information retrieval system, for Canadian scientists and 
technologists, has now been fully operational for two years and three months. 
During this period five papers have been published in the national and interna- 
tional literature dealing with specific aspects of the service - e. g. systems 



design and programming, training of users, and user reaction to the serv- 
ice^’ ' * , This, the sixth paper in the series, is an attempt to provide 

an overview of the CAN/SDI system with particular emphasis on the strengths 
and weaknesses of the service from the point of view of the providers and users, 
and a look at plans for future development. 



For those of you who have read these earlier papers we apologize for what may 
appear to be inexcusable and useless repetition. For those who are not familiar 
with the service, we apologize for the omission of technical details which are 
certainly important but which cannot be dealt with adequately in a paper of this 
nature. For both groups, particularly those who are experimenting with or 
considering a similar service, we trust this summation of our experience will 
prove interesting and useful. 
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The National Science Library of Canada 

The CAN/SDI service is one of many national information services provided by 
the National Science Library (NSL) and to see this program in proper perspective, 
it is necessary to briefly review the role of the NSL as Canada's major agency 
responsible for the dissemination of scientific and technical information (ST1) . 

The NSL, a Division of the National Research Council of Canada (NRC), is not a 
library in the conventional sense of the word but rather an information transferral 
agency. Its total activities are designed to supplement and complement local in- 
formation services and to ensure that Canadian scientists, technologists, indus- 
trialists and managers, have ready access to publications and information required 
in their day to day work. The NSL f s resources, now totalling some 800,000 
volumes, and its information services, have been developed in close cooperation 
with all major libraries in Canada. Through these and related cooperative meas- 
ures, and through the utilization of Telex linkage with world -wide information 
services, the NSL serves as the focal point of a national scientific and technolog- 
ical information network. 

A detailed description of the nature and extent of the NSL* s resources and serv- 
ices may be found in the NSL* s Annual Report and Guide to Resources and Serv- 
ices 1969 -70^, and need not be repeated here. However, to emphasize the rather 
unique role of the NSL, as compared with the information services of similar 
agencies in other countries, two points should be noted: 

1. The NSL's information retrieval and dissemination services and the essential 
backup resources form an integrated administrative unit. 

2. The NSL is the major unit of a national decentralized scientific and technolog- 
ical information system. 

Development of a National STI System 

The NSL had its beginnings in 1925 as the library serving the NRC of Canada. Its 
change from that of a library serving a group of federal laboratories to that of a 
national information centre has come about through a long series of government 
decisions and directives - the change has been evolutionary not revolutionary. 
One of the most important of these decisions was made in December 19^9 when, 
as the result of a series of government studies of science and science information 
policies*’ 2 , the NRC was named the coordinating body responsible for the further 
development of a national STI network. 

This action recognizes the effectiveness of a decentralized information system in 
a country the size of Canada as opposed to a highly centralized system. It also 
represents a crucial point in the development of Canadian information policy, for 
the government directive contains clauses intended to ensure that a national 
system for STI, which is the responsibility of the NRC, and a national system for 
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information in the humanities, social sciences and the arts, which is the respon- 
sibility of the National Library, are completely compatible and developed in 
parallel. A full account of this development may be found in a paper entitled "A 
National STI System for Canada"* 0 . 

CAN/SDI - Basic Principles 

The NSL is continually experimenting with and developing new techniques to facil- 
itate the storage, retrieval and dissemination of information. The CAN/SD1 
system is one of the more successlul of these techniques and is regarded as an 
extension of the Library's Reference and Research Services and as another tool 
to assist researchers obtain maximum use of pertinent literature. 

The CAN/SDI service was inaugurated as a national service in April 1969, after 
three years of testing and experimentation using NRC and other Ottawa scientists 
as guinea pigs. During thiB experimental period three major problems were 
identified as requiring solution before a national service could be provided. 

1. The incompatibility of available data bases or tape services from the point of 
view of file formats. 

2. The development of interest profiles for users remote from Ottawa. 

3. The training of potential users of the system. 

Development of a Standard File Format 

The solution of the first of these problems, incompatibility of data bases, was the 
one requiring urgent attention and the one which provided most difficulties. It 
was obvious that anytime we wished to add a new data base to the system, a new 
search and print phase had to be written * a costly and difficult undertaking. 
Accordingly, after much experimentation, a standard file format, based on the 
MARC format, was designed. The steps leading to the development and adoption 
of this format are described in a paper by J. Heilik^. All data bases used in 
CAN/SDI are converted to this standard format thus requiring only one search 
program for the data bases and one program to print the results of this common 
search. The MARC format is extremely flexible and can handle with ease almost 
any kind of information. 

With the development of this standard format new data bases were rapidly added 
and at present five tape services are being employed - Chemical Titles, Chemical 
Abstracts Condensates, IS1 Source and Citation, 1NSPEC and Biological Abstracts. 
A sixth data base, MEDLARS, is also being used to provide both current-aware- 
ness and retrospective searches, but the programs used for query formulation 
and computer searches are those developed by the U.S. National Library of 
Medicine. At present search requests arc formulated at the NSL, but until the 
MEDLARS II programs are available for use with the NSL’s computer facilities, 
the tapes are being processed at the Texas Medical Center in Houston. 
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These data bases provide users of the service with access to the contents of 
approximately 9000 journals or 900,000 papers per year in all fields of science 
and technology. All cited journals are included in the 16,000 periodicals received 
by the NSL and photocopies of cited papers, not available locally, are provided 
upon request. 

Through the use of an IBM 360 Model 65 computer, interest profiles are matched 
against authors, titles, journals, key words and/or descriptors as stored on the 
tapes mentioned above. Each subscriber receives weekly or biweekly, depending 
on the tape services which he uses, a computer print -out of references to papers 
covering his specific fields of interest. The print-out is on sheets perforated to 
produce cards the size of IBM key-punch cards. Duplicate prints containing ques- 
tions regarding the relevancy of the citations facilitate feedback from users to the 
NSL. A sample of a typical print -out with a key to the format is shown in Figure 1. 

Solution of the second problem, the development of interest profiles for users 
remote from Ottawa, while presenting fewer technical difficulties, has proved 
equally costly and time consuming. 

Training of Search Editors and Users 

Again, during the experimental period, it soon became evident that while the 
searching of data bases could be centralized, the creation of interest profiles 
must be decentralized. Our present experience indicates that at least ten data 
bases, providing an SDI service to 10,000 users, can be readily processed 
centrally using appropriate computer facilities. On the other hand, it is equally 
evident that effective interest profiles are more difficult to construct via telephone 
conversations or correspondence, and that it is essential to arrange for face-to- 
face conversations between the subscriber and someone fully trained in the art of 
profile construction. This is indeed a major problem in a country where distances 
of up to 3000 miles can separate potential subscribers from the focal point of this 
SDI service. The problem Was solved in two ways. 

Seminars were established to train Search Editors in the art of profile construc- 
tion^. These two-day seminars are presented at the NSL in Ottawa, and initially ... 
were held every two weeks. Now that the preliminary rush has subsided, the 
seminars are held as required, usually once a month. To facilitate training, 
attendance is usually limited to ten participants. During this period the pupils 
are thoroughly acquainted with the main features of the SDI program and are 
given practice in the actual techniques of profile construction. 

Search Editors are responsible for assisting SDI users in their respective or- 
ganizations and constitute a direct link between the NSL and subscribers in all 
parts of Canada. They are also responsible for submitting properly constructed 
profiles to the NSL, for updating and sharpening profiles once they arc operational, 
and for scanning and distributing the printed bibliographies to the users they have 
assisted. 
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To supplement the training sessions held in Ottawa, the NSL conducts seminars 
in other parts of Canada where there is a concentration of potential subscribers, 
as for example at large universities and industrial parks. 

As of April 1971, 250 Search Editors have been trained. Of this number 30% 
came from federal and provincial government agencies, 40% from universities, 

20% from industry and 10% from hospitals, public libraries, provincial research 
councils, etc. 

The training of Search Editors and of users has also been facilitated through the 
preparation of a "Profile Design Manual'^which is distributed without charge to 
Search Editors and SDI users. The Manual, now in its second edition, is period- 
ically reworked and updated to incorporate refinements in the system and to 
clarify instructions. Some research workers have found the Manual sufficient to 
enable them to construct effective profiles without the intermediary of a Search 
Editor* The majority of subscribers, however, prefer to work directly through 
a trained Search Editor. 

Cos ts and Subscription Fees 

Now a few words about costs. Until recently subscribers to the CAN/SDI service 
were charged $100 per year for an interest profile of up to 60 terms and the use 
of one data base. An extra $15 per year was charged for each additional data 
base used. Two major factors were considered in establishing this fee. It was 
designed to recover the major costs of providing the SDI service, and to produce 
a fee which would discourage the diletante, but not deter the serious research 
worker. The figure, while initially little more than an educated guess, accom- 
plished its purpose for within two months after announcement of the service, 210 
subscribers had signed contracts, and this for an experimental service which, as 
far as users were concerned, had not proved its worth. 

After two years of operation we are now serving approximately 800 subscribers 
requiring a total of more than 3500 individual computer queries, and resulting in 
the preparation each week of over 1000 personalized bibliographies. Because of 
group profiles and profiles containing multiple questions, the program actually 
serves 1400 individuals representing 7000 questions or subject interests. During 
the past year 52,000 tailor-made bibliographies were supplied to SDI subscribers. 

This growth in the use of the CAN/SDI service has come about primarily through 
word of mouth between satisfied subscribers. Except for brief announcements in 
NSL Newsletters and articles in national journals, no attempt has been made at 
large scale publicity measures. Staff shortages and inadequate budgets made it 
impossible to cope with a faster rate of growth in the number of subscribers. 
However, NSL staffs and budgets have been increased in keeping with the wider 
responsibilities now assigned to the NSL, and an extensive publicity campaign is 
being prepared. With such measures it is anticipated that subscriptions will 
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double within the next 12-18 months. 

The CAN/SDI program is* carried out at the NSL by a staff of eight - a director, 
assistant director, one systems analyst, two search editors, two clerical assist- 
ants and one key-punch operator. This number does not include staff support 
provided by the NRC Computation Centre. 

After two years of service the CAN/SDI program has proved to be a valuable 
information retrieval service. During this time the NSL has also had an opportu- 
nity to determine the actual costs of providing this service. Based on this new 
information and effective April 1, 1971, a new pricing schedule was implemented 
to reflect actual use, computer time, staff salaries, leasing of tapes and mailing 
charges. The annual subscription concept has been replaced by a system of 
charging per tape searched. A minimum fee of $40, which provides for a profile 
of 60 terms is now being charged. At the end of each 12 month period the 
subscriber is invoiced for an additional amount based on his use of the data bases 
available. Under this system, and with a profile not exceeding 60 terms, users 
of ISI Source tapes and CA Condensates will be charged $117 per year. Users of 
Chemical Titles, INSFEC, Biological Abstracts and MEDLARS will be charged 
$45, $78, $81 and $60 respectively. The new schedule will thus result in 
increased costs for some subscribers but for others there will be a considerable 
reduction. 

Feedback from users, via the duplicate citation print-out, has clearly indicated 
the usefulness of the CAN/SDI service and the general satisfaction of subscribers. 
The value of the service was confirmed when, after one year of operation, 80% of 
the original users renewed their subscriptions. In no case was there a cancella- 
tion because of dissatisfaction with the service. Cancellations were due primarily 
to changes in occupation or responsibility, lack of the necessary fee, or unsuit- 
ability of the data base. 

User Evaluation of CAN/SDI 

Because of the experimental nature of the program, the high investment in tape 
services, staff and computer time, and in spite of favourable response, it was 
decided that a wider sampling of user reaction was required in order to plan 
future developments. In June 1970 a questionnaire was mailed to 604 users of 
which 406 replied. The questionnaire sought expressions of opinions regarding 
completeness of subject coverage, additional tape services, deficiencies in the 
system, multiple use of computer produced bibliographies, and usefulness of the 
service®. 

Coverage of chemistry, physics, electrotechnology and mathematics was judged 
good, but there was a strong demand for better coverage of the biological 
sciences. The NSL responded by adding Biological Abstracts and BA Previews 
to its list of data bases. 
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There was an overwhelming demand for a retrospective searching service, which 
up to this time is available only through the MEDLARS tapes. Studies are now 
under way to determine whether the CAN/SDI service, which is essentially a 
current -awareness service, should be expanded to allow for retrospective 
searches, or whether a complementary national service to deal with back-file 
queries should be established. 

A major criticism of the service centred on the delays experienced in acquiring 
hard-copies of cited papers. This comment was particularly disconcerting since 
the NSL, in initiating the SDI service, had guaranteed that all source literature 
covered by the various data bases would be available either in its own collections 
or in those of local libraries and information centres. Clearly there is a contin- 
uing need to strengthen the resources of local libraries. Furthermore the 
communications processes whereby information is transferred from the NSL to 
the ultimate user, must be improved. With the establishment within the NRC of 
an Advisory Board for Scientific and Technological Information and increased 
support of NSL resources and services, the prospect of correcting these defi- 
ciencies is most encouraging. 

The questionnaire raised the perennial problem of relevance. Some users of the 
CAN/SDI service are satisfied with a 10 - 15% relevancy, while others want and 
claim to have achieved 95% relevancy. In any case the comments indicated that 
only the user can and should decide as to what is relevant, marginally relevant, 
or irrelevant. The comments also underlined the importance of developing a 
satisfactory procedure for preparing individual profiles and for continual adjust- 
ment through feedback. 

The question regarding usefulness of the service produced some startling informa- 
tion - namely that those users who responded to this question had achieved a 
combined saving of 39,000 man-hours per year gathering interest related informa- 
tion through use of the CAN/SDI service. A surprising number of users also 
indicated that, because of their use of CAN/SDI, they had given up conventional 
search procedures. 

The survey confirmed that the CAN/SDI is an effective current-awareness service 
and that it is becoming an important element in the Canadian national system for 
the dissemination of STI. It has achieved considerable success in reducing the 
time spent by researchers in monitoring the current literature while at the same 
time providing increasingly reliable and extensive coverage for industry, univer- 
sity and government. 

Plans for the Future 

The ever expanding need for information by science, government and industry, 
and the rapid improvement in techniques for processing information, makes it 
extremely difficult and perhaps unwise to make plans for future action, unless 
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the plans can be readily changed. In these matters the NSL's crystal ball is no 
clearer than ones used by other countries which are also struggling to improve 
the dissemination of information. 

With these reservations in mind, and speaking only of the CAN/SDI Project, the 
NSL does plan to extend and improve its SDI services along the following lines: 

1. Extensive publicity to acquaint more Canadian scientists, technologists, 
managers and others with the CAN/SDI Project. 

2. Expansion of the user training program. 

3. Utilization of new data bases to obtain wider coverage of scientific and tech- 
nical literature. 

4. Development of retrospective searching programs. 

Simultaneously, and in keeping with the government's directive to develop in 
concert with existing information organizations a national ST1 system, there will 
be a continuing study of the project to determine: 

1. The degree to /hich central processing of tapes can or should be expanded. 

2. Those activities of the CAN/SDI Project which could be performed more 
efficiently by private agencies, e. g. tape searches and distribution of print- 
out. 

3. The desirability of assigning to other agencies responsibility for providing 
within CAN/SDI, an SDI service covering subjects in which they have special 
competence. 

4. The cost-effectiveness of the service. 

Conclusion 

This has been a rather hasty and incomplete review of one country's rather 
halting steps to provide a national SDI service through the adoption of an informa- 
tion handling technique which was developed as early as 1958. Any claims to 
originality and uniqueness rest primarily on the designing of a program which 
effectively overcomes the problem created by the incompatibility of data bases, 
and the utilization of both centralized and decentralized procedures to provide a 
current-awareness service to a relatively large population in one of the world's 
largest countries. Whatever success the program has achieved is due entirely 
to the ingenuity and efforts of the staffs of the NSL and of the NRC's Computation 
Centre, and to the cooperation and enthusiasm of those Canadian scientists who 
served as guinea pigs in the early stages of experimentation. 
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THE COSTS OF DOCUMENTATION SERVICES BASED ON MAGNETIC TAPES 
Basis for the Choice of Price Policy 

Scandinavian Conference in Copenhagen, 28th - 29th October 1970 

Reported by: Helge J. Skov 

Documentation Department 

National Technological Library of Denmark 

SYNOPSIS 

The actual costs of running computer-based SDI services 
can be split up in 4 main components: Disbursement to 
the supplier of the tape system, working expenses of the 
documentation center, costs of the electronic data pro- 
cessing, and expenses connected with the procurement of 
the original material to the subscribers. The costs per 
profile for running 50 or 300 profiles in the Chemical 
Abstracts Condensates system are analysed, and distribu- 
ted on the four components mentioned. 



1. Background, Introduction 

The Nordforsk committee of technical information, which is the expert group of 
Nordforsk in the field of Documentation and information, has for several years been 
concerned with the problem of cooperation between Denmark, Finland, Norway and 
Sweden in connection with computerized documentation services. As part of this co- 
operation a number of conferences and some smaller meetings have been held in order 
to discuss the problems of computer based documentation. Issues of current interest 
which have been discussed at these conferences, are the establishment of Scandinavian 
agreements regarding the procurement of documents, the use of tape services and re- 
lated activities, the training of staff, and a common price policy. In this connec- 
tion the need for a concrete numerical basis for the discussions has arisen. 

The committee of technical information has therefore decided to arrange a conference, 
which through lectures, discussions and meetings of working groups should give a 
realistic picture of how much it actually costs to run a tape service. Furthermore 
the conference should serve the purpose of creating an analytical instrument for the 
policy-forming institutions of information and documentation in particular countries 
and form the basis for decisions related to the running of local or other tape ser- 
vices. The conference was planned and arranged by Mr. H.J. Skov, National Techno- 
logical Library of Denmark. 
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2. The Programme, Participation 

The programme and list of participants appears from appendix No. 1. 

The conference was divided into three sessions: Approach to the problem, groupwork, 
and conclusions. 

Altogether 22 persons took part in the conference, namely 5 from Denmark, 4 from 
Finland, 5 from Norway, 5 from Sweden and 3 from Nordforsk. Participation in the 
conference was based on an invitation of the Nordforsk committee of technical infor- 
mation after consultation, with the central institutions of scientific and technical 
information and documentation of each individual country. 

3. Working Groups 

As a concrete topic for the discussions of the conference a system as for instance 
Chemical Abstracts Condensates (CAC) with 50 or alternately 300 annual search profiles 
was chosen for elucidation from an economic point of view. It was assumed that the 
total costs could be divided into 4 main components: Disbursement to the supplier of 
the tape system, working expenses of the documentation center, costs of the electronic 
data processing, and expenses connected with the procurement of the original material 
to the subscribers* 

After an introduction by Mr, Th* Franck and Mr. H.J. Skov short lectures of 20 minutes 
each were given by Mr. Anders Kallner, Mrs* Ch. Lindqvist, Mr, Peter Svare, and 
Mr. J. Brandrud as introduction to the group work. 

4 working groups were established in accordance with the competence and wishes of each 
participant. Each of the 4 groups was given the task of elucidating one of the 4 
mentioned cost components* At the end of the group work each group presented a 
written summary of its work. These summaries are included as appendices No. 2a-2d. 

Plenary Meeting 

The conference finished with a plenary meeting including a discussion of the results 
of tbe working groups and a conclusive debate under the leadership of Mr. Th, Franck. 

A summary of the conclusions appears in appendix No. 3. 

According to the results of the conference and with the reservations stated in the 
appendix it is possible to put forward the following approximate distribution formula 
for the cost components of a system of the type CAC* The cost components are stated 
below in per cent per profile per year for 50 and 300 profiles (one profile comprises 
20 search terms) • 
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S.Kr. 

(Swedish Crowns) % 





Number of profiles: 


50 


300 


50 


300 


I. 


Disbursement to the 
supplier. 


600 


150 


15 


4 


II. 


Working expenses of the 
documentation center 


1000 


1000 


25 


30 


III. 


Costs of the electronic 
data processing. 


1400 


1200 


35 


36 


IV. 


Expenses connected with 
the procurement of 
original material. 


1000 


1000 


25 


30 




Total: 


4000 


3350 


100 


100 



The calculations, which were made in Swedish currency, revealed that the total amount 
for CAC per profile per year is between 3,700 and 4,400 Swedish Crowns for 50 profiles 
and between 3,000 and 3,800 Swedish Crowns for 300 profiles. The variations in che 
estimated costs of the documentation center and electronic data processing is not 
surprising in this very first attempt to make a cost estimate of components of docu- 
mentation services. Later on comparative calculations may give less varied figures. 

Approved 30th January 1971 
Vibeke Aromundsen 

President of the Committee of Technical Information, Nordforsk. 
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Denmark: 



Finland : 



Norway: 



Available Services 

Appendix No. 1 
Participants 

Mrs. Inge Berg Hansen, M.Sc. 

National Technological Library of Denmark 
0ster Voldgade 10 
DK-1350 Copenhagen K. 

Mr. Denis Dion, Chief adviser 
I/S Datacentralen 
Ved Stadsgraven 15 
DK-2300 Copenhagen S 

Mr. Holger Friis, Deputy Librarian 
The University Library, Division II. 

N0rre Alle 49 
DK-2200 Copenhagen N 

Mr. Helge Skov, M.Sc. 

National Technological Library of Denmark 
Documentation Department 
0ster Voldgade 10 
DK-1350 Copenhagen K 

Mr. Peter Svare, M.Sc. 

I/S Datacentralen 
Ved Stadsgraven 15 
DK-2300 Copenhagen S 

Mrs. Birgitta Holm, M.Sc. 

Centrallaboratorium AB 

Box 101 36 
Helsingfors 10 

Mr. Sauli Laitinen, M.Sc. 

Helsinki Technological University Library 
OtnUs 

Mrs. Christina Lindqvist, M.Sc. 

Finska Socker AB 
Harju IB 12 Kantvik 

Mr. Ilkka Roman, M.Sc. 

Helsinki Technological University Library 
Otnas 

Mr. J. Brandrud, Deputy Librarian 
The National and University Library in Oslo 
Dramm^nsveien 42 
Oslo 2 
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Sweden ; 



t. 



Nordf orsk: 



Mr. 0. Brisner, Administrative secretary 
Norwegian Industries Development Association 
Forskningsveien 1 
Oslo 3 

Mr. H.M. Fagerli, Technical adviser 
The National and University Library in Oslo 
Drammensveien 42 
Oslo 2 

Mr. W. Holst, Manager 

Norwegian Industries Development Association 
Forskningsveien 1 
Oslo 3 

Mr. K. Thalberg, Library director 

Library of the Technical University of Norway 

Trondheim 



Mr. Hans Baude, Library director 
The University Library in Linkoping 
Fack 

581 83 LinkUp ing 1 

Mrs. M. Edstrom, Librarian 

Royal Institute of Technology Library 

100 44 Stockholm 70 

Mr. A. Kallner, Dr. t University lecturer 

Royal Caroline Institute 

Fack 

104 01 Stockholm 60 

Mr. Wolfram Uhlmann, M.Sc. 

Forsvarets Forskningsanstalt 
F0A Index 

104 50 Stockholm 80 

Mr. Kjell Olmestrand, Head of terminal 
Inst, for ekonomi och statistik 
Datasektionen 
750 07 Uppsala 7 

Mr. Th. Franck, Managing director (Member of the 
board of directors of Nordforsk) 

Danish Council for Scientific and Industrial Research 
H^rsholm Kongevej 2 
DK-2970 Hrfrsholm 
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Mrs* Vibeke Amraundsen, Library director (President of 
the Committee of Technical Information) 

National Technological Library of Denmark 
0ster Voldgade 10 
DK-1350 Copenhagen K 

Mr. Erling Hagen, M.Sc. 

Nordforsk 

Main secretariat of Nordforsk 
Box 5103 

102 43 Stockholm 5 
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Appendix 2a . Record from Working Group I: 

Disbursement to the Supplier of the Tape System 



Participants: 



Denis Dion 
Helge J. Skov 
Birgitta Holm 
K. Thalberg 
A. Kallner 



Task 

Construct and analyse a model of the disbursement to the supplier of the tape system 
taking into consideration the subscription and licence fee for e.g. references printed 
out (hit) both for current awareness services and for retrospective searches. Attempt 
also to estimate how much a supplier of a data base could be expected to demand as 
compensation for allowing storage of the data base on e.g. a disc to which many docu- 
mentation centers or users might have direct access. 



Account 

In the following account we will confine ourselves to the conditions which affect 
Chemical Abstracts Service (CAS) and the data bases made by CAS, firstly Chemical 
Abstracts Condensates (CAC) . The reason for this is the fact that CAS has a relatively 
long experience with tape systems in this connection and that their systems are re- 
latively comprehensive. Furthermore the policy of CAS has been of guidance to many 
other documentation systems as for instance BIOSIS, COMPENDEX, ABIPC, etc. 

CAS has asserted that in the future strict agreements should be made on the basis of 
a fixed base price and a certain royalty according to which CAS will receive an income 
in relation to the use of the system. The working group believes that the absolute 
level of the base price and the royalty cannot be discussed objectively in this con- 
nection, but is of the opinion that the present level is reasonable. 

According to the working group the cost component of a profile with which the sub- 
scription contributes is as follows: Base price $ 4,400, freight, magnetic tapes 
etc., $ 1,000. A profile gives on average 26 references per search, which corresponds 
to a price per profile of approximately $ 120 and $ 30 at a load of 50 and 300 pro- 
files, respectively. The above calculation does not take into consideration the pos- 
sible use of the data base in connection with retrospective systems. 

The group is also of the opinion that a leasing agreement with CAS as the formal 
owner of the tapes is acceptable. 

The main objection to the proposed contracts concerns the restrictions according to 
the wishes of CAS regarding the use of the data bases. According to CAS every form 
of reproduction and issuing of the tape contents should be explicitly forbidden. CAS 
assumes that the customers of the documentation centers will accept this condition. 

As far as we can see, this will, to a great extent, prevent a rational exploitation 
of computerized documentation services without clearly reducing the possibilities 
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for companies to sell their printed publications and magnetic tapes* We are of the 
opinion that these two documentation systems supplement each other and will not com- 
pete when within a few years a less dramatical situation can be foreseen. 

Consequently the working group suggests that a future discussion concerning the for- 
mulation of the contract should aim at the abolition of these limitations. We suggest 
that three main lines of action be tested; these are listed below without any relevant 
priority: 

1. Another form of financial agreement should be pursued. It is obvious that the 
Scandinavian countries in common or separately could take part in delivering input to 
the system. This could be done for instance by the Scandinavian countries undertaking 
(like U.K.) the editing and drawing up of abstracts and indexing of articles which are 
published in the Scandinavian technical journals. 

2. Negotiations should be started with the purpose of establishing a price at which 
the producer (CAS) is willing to sell all material without the present limitations 
of use. In this connection several specific partial solutions could be considered, 
which allow e.g. the building up of retrospective files, direct access, the printing 
out of text on the tapes, etc. 

3. An adequate indexing of scientific literature is not only of international interest, 
but also to a high degree an international responsibility. This should not be super- 
vised by one single national institute or organization, instead one should aim at 
attaining a “multinational indexing-institute” where different nations for instance 

the Scandinavian countries, in common or separately, commit themselves to grant a 
certain financial support to the indexing institute without a direct relationship to 
the use of the institute. An estimation of this financial support could serve as a 
feature in the discussion of item 2 above. 

28.10.1970 
A. Ka liner 
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Appendix 2b . Record from Working Group II: 

Working Expenses of the Documentation Center 

Participants: Holger Friis 

Sauli Laitinen 
Christina Lindqvist 
W. Holst 
Malin EdstrHra 



Task 

Construct and analyse a model of the working expenses of the documentation center* 
Which and how big a staff is necessary and how much will it cost? Appraise the 
office rent, travel expenses, instruction and teaching work in connection with a 
modern and progressive center* Estimate also the costs Involved in the current 
training of the staff required* 



Account 

Since none of the members of the group have experiences from Chemical Abstracts we 
had to base our conclusions on the experience of KTH (Royal Institute of Technology 
Library) in Stockholm with its 12 tape systems and THB (Helsinki Technological 
University Library) with PANDEX as well as Danish experience with MEDLARS* 

Regarding the alternatives, 50 and 300 profiles, the group found that a total of 
50 profiles would not be sufficient to form the basis for a documentation service 
based on tapes except for an introductory period. 

The group has summed up the staff requirements as follows: 

For the profile formulation: 2-4 hours/profile* 

In order to raise the number of profiles to 300 when starting from scratch 1/2 
manyear is required. 

In order to maintain these profiles in connection with cancellations, new subscrip- 
tions and changes, it Is necessary to have: 

minimum: 1 information officer 

maximum: 2 information officers 

with the supposition that 50% of the time is used for work with the profiles while 
the other 50% is spent in keeping contact with clients including time for training. 

In order to follow up this work by punching profiles, copying and sending out print- 
outs each week, as well as invoicing and procuring articles for copying it is neces- 
sary to employ: 

minimum: 1 secretary 

maximum: 2 secretaries 
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The Necessary sales work sums up to 1 many ear, however, the group has Included this 
work In an overhead of 100%. 

Once-for-all procurement of 

brochures and leaflets D. kr. 10.000,00 

Annual demand for brochures D. kr. 5.000,00 

The salary level Is clearly different in the four Scandinavian countries. The group 
has agreed upon the following salary base (D.kr. » Danish Crowns): 



Documental! st 


(3 years experience) 


70.000,00 


Secretary 




35.000,00 


Sales staff 




70.000,00 


The staff costs amount to 


(D.kr.) : 




The working up 


of 1/2 manyear 


35.000,00 


300 profiles 


Maintenance of 


minimum 1 - 


70.000,00 


300 profiles 


maximum 2 - 


140.000,00 


Assistance 


minimum 1 secretary 


35.000,00 




maximum 2 secretaries 


70.000,00 


Instructive 

material, 


brochures, etc 


• 


5.000,00 


Total: 


Minimum (D.kr.) 


Maximum (D.kr.) 


The building up 


35.000,00 


35.000,00 


Maintenance 


70.000,00 


140.000,00 


Secretary 


35.000,00 


70.000,00 




140. 000,00 


245.000,00 


Overhead 100% 


140.000,00 


245.000,00 




280.000,00 


490.000,00 


Brochures 


5.000,00 


5.000,00 


Travelling 


10.000,00 


10.000,00 




295.000,00 


505.000,00 



With 300 profiles this gives a cost per profile of: 

minimum; D.kr. 1,000 (S.kr. 750 with a rate of exchange of 0,75) 
maximum: D.kr. 1,700 (S.kr. 1,250 with a rate of exchange of 0,75) 

28.10.1970 
W. Holst 
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Appendix 2c * Record from Working Group III: 

Costs of the Electronic Data Processing 

Participants: Peter Svare 

Ilkka Roman 
0. Brisner 
H,M« Fagerli 
W, Uhlmann 
K, Chimes trand 



Task 

Construct and analyse a model of the costs related to the operation of hardware and 
the development of software as for instance search programs. Furthermore evaluate 
from an economic point of view whether print-outs of references with long abstracts 
are more profitable than print-outs which only contain author, title, keywords, and 
journal reference* 

Hake also an appraisal of the advantages and drawbacks of having a computer center 
of its own in comparison to running the tapes on a bigger center on time sharing 
basis* 



Account 



Costs of development 
Reformating programs: 

A general Information retrieval system requires one reformating program for each tape 
type. The purpose of the reformating program is to edit data from the original tape 
to the format of the search program. 

Planning, programming and testing of a reformating program costs about 5.000 - 
10.000 S.kr. (Swedish Crowns), 

Other information systems: 

The other information systems consist of programs for reading in, searching, sorting 
and printing out. The costs for the development of such a system are highly dependent 
on how complicated a system is required. As an indication of the costs we have chosen 
the figures corresponding to CORSAIR and TELETEXT. 

The development of CORSAIR has to date cost 1/2 - 3/4 million Swedish Crowns. 

The development of TELETEXT has to date cost approximately 1/2 million Swedish Crowns. 

The present annual development costs are for CORSAIR 100-200,000 S.kr. and for 
TELETEXT/BATCH approximately 40,000 S.kr. 
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The Working Expenses 

The working expenses cover the use of hardware and operator assistance during the 
runs. However, the preparation of search runs is not included since it is assumed 
that the preparation is done at the documentation center. 

Costs connected with the reformatingi 

The magnitude is 100 S.kr. for 1,000 references of 1,000 characters each. 

Costs of searching and printing out: 

The scanning of a tape with 5,000 references and printing out of 100 lines per profile 
costs about 20 - 25 S#kr. per profile. 

The following formula can be used for the calculation of the cost magnitude in con- 
nection with other numbers of search terms: 

Costs: (120 + 0,03 • T + 0,00013 • S • T + 0,1 • L) • 0,83 S.kr. 

T « number of references searched 
S » number of search terms 
L = number of lines printed out. 

Compared with the other costs of the data processing the number of lines printed out 
is of less importance and should therefore be determined from a documentation point 
of view. 

Regarding the question whether a documentation center should have its own computer 
or not, one can state that generally there are considerable advantages connected 
with the running of big computers. 



28.10.1970 
Peter Svare 



Note : 

The total cost is: 

for 50 profiles (CAC) : 1,300 - 1,550 S.kr. yearly per profile, 
for 300 profiles (CAC): 1,100 - 1,350 S.kr. yearly per profile. 

E. Hagen 



640 




Choice of Price Policy 



Appendix 2d . Record from Working Group IV: 

Expenses Connected with the Procurement of the Original 
Material to the Subscribers 



Participants: Vibeke Ammundsen 

Inge Berg Hansen 
J. Brandrud 
H. Baude 



Task 

Construct and analyse a model of the costs connected with the procurement of copies 
of journal articles, patents, etc., often from libraries far away. In this connec- 
tion one should also take into account the consequences of more rigorous copyright 
conditions according to the trends in the big countries, since efficient SDI-services 
will reduce the industrial users' requirements for having their own subscriptions 
to periodicals. 



Account 



Fundamental considerations 

1. Free copying for subscribers is considered unrealistic* 

2. Payment is required only of the actual costs excluding labour costs. 

3. The documentation center should have a reasonable coverage in its own collection 
of literature in relation to the service it offers. For reasons of time and 
economy one should aim at Scandinavian coverage of the relevant journal literature 
to the greatest extent possible; this implies the existence in one form or 
another of a certain survey of the stock of periodicals at the scientific libra- 
ries in Scandinavia. 

The question of the translation services should be evaluated separately. 

4. Each particular Scandinavian country should as far as possible exploit its own 
resources, but in such a way that specialized libraries should still be able to 
establish direct contact across the borders* 

Cost calculations 

The expenses connected with the procurement of original material is difficult to 
calculate exactly and will vary from subject to subject. The expenses are also de- 
pendent on how the material is procured. Time consumed on the ordering side could 
on an average be estimated at 1/2 hour per order. The calculations are based on the 
technical or scientific articles having an average length of 8 pages according to 
the National Technological Library of Denmark. Furthermore (for practical reasons) 
the calculations are based on the prices of National Lending Library* The expenses 
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connected with the procurement of a copy of an article can then be characterized as 
follows: 



Labour costs: 1/2 hour at 30 N.kr. 

Payment for copies 

Various expenses (postage etc.) 



15 N.kr. (Norwegian Crowns) 
5 N.kr. 

2 N.kr 



Minimum cost per copy of an article: 



22 N.kr. 



It is also possible to calculate the costs per profile. We lack accessible data re- 
garding Chemical Abstracts Condensates, but the experience of the National Technolo- 
gical Library of Denmark from their work with Chemical Titles shows that on average 
180 copies (pages) per profile (1 profile « 20 search terms) per year are ordered. 

This corresponds to approx. 22 articles of 8 pages each. Accordingly a profile will 
cost approximately 500 N.kr. per year. The reservation is made that many of the 
users have access to big collections of their own. If all orders for copies were 
handled by the documentation center one would have to allow for an addition of 25%, 
which would give a price per profile of approximately 625 N.kr. per year. 

If the calculations were transferred to CAC then at least a doubling, that is at 
least 1,250 N.kr. per profile, can be expected. 

If one also takes into account a possible future copyright fee of e.g. 15% (of the 
copy costs), the price per profile for CT amounts to approximately 660 N.kr. per 
year. 

On the assumption that the expenses of labour should be covered by the community 
the customer would pay 140 N.kr. of an annual cost of 625 N.kr. per profile, while 
the documentation center pays the rest (485 N.kr). 

In the above calculations it has not been found relevant to include the costs of the 
stock of periodicals. 

Copyright problems 

More rigorous copyright conditions would have a restrictive influence on the utili- 
zation of research results. It is desirable to reach decisions which do not stimulate 
violations of the rules by the restrictions they impose. The establishment of an 
obligation to pay the publisher/author a fee for copying is recosmended by analogy 
with the arrangement existing for musical works. This would give the periodicals a 
compensation for sales losses and probably also prevent misuse. 

The group has not found it possible to take up a position as regards aerioua copyright 
restrictions hindering or delaying copying for research purposes. 

28.10.1970 
J* Brandrud 



642 



Ch'J-' at: of Vvice lolicii 



Appendix 3 

Summary of the Records from the 
Working Groups* Conclusion 



The results of the working groups are stated below as costs in S.kr, (Swedish Crowns) 
per profile per year for a tape system like CAC with a load of 50 alternatively 300 
profiles (one profile comprises 20 search terms). 

The comments from the plenary meeting and the total costs are mentioned below. 



CAC costs per profile per year 





Number of profiles: 


50 


300 


I. 


Disbursement to the 
supplier. 


600 


150 


II. 


Working expenses of the 
documentation center. 


750 - 1250 


750 - 1250 


III. 


Costs of the electronic 
data processing. 


1300 - 1550 


1100 - 1350 


IV. 


Expenses connected with 
the procurement of 
original material. 


1000 


1000 




Total: 


3650 - 4400 


3000 - 3750 



Re. It 

The utilization of the database in connection with a retrospective system has not 
been taken into consideration in the calculations. 

Re. Ill 

The calculations have only been carried out for 300 profiles, since 50 profiles is 
considered too small a quantity to constitute a proper economic foundation for a 
tape-based service of one's own. In order to obtain a realistic standard of refer- 
ence the cost interval corresponding to 300 profiles is also used in the case of the 
50 profiles. 

Re* Hit 

The calculations are presented as costs evaluated in the case of starting a new es- 
tablishment* However | if leased computer time is to be used on a commercial basis 
then the prices will vary within relatively wide limits; accordingly a certain margin 
will probably be applicable to this cost component* 

On the basis of accounts for the period 1968-70 an average annual coat per profile 
for CAC of 803 S«kr*i corresponding to a load of 276 profiles (of 25 search terms 
each) has been indicated by the Swedish participants* The market price for computer 
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time was in this case 755 S.kr. per hour. 

Re. IV : 

In this case the costs per profile are dependent on the number of profiles. The 
numbers given are based on Chemical Titles # while CAC is expected to imply a doubling 
of the copies ordered. 

The total costs are here rounded off to 1000 S.kr. per profile per year. 

Re. Total: 

The stated sums of the components should not be taken as being based on accurate and 
fully valid calculations but rather as an expression of a magnitude. By stating 
approximate average values for each cost component one should therefore be able to 
simplify the account without introducing new errors of importance. If this is done 
the cost distribution in S.kr. and % becomes: 



CAC costs per profile per year 
S.kr. % 





Number of profiles: 


50 


300 


50 


300 


I. 


Disbursement to the 
supplier 


600 


150 


15 


4 


i ii. 

i’ 


Working expenses of the 
documentation center. 


1000 


1000 


25 


30 


; in. 


Costs of the electronic 
data processing. 


1400 


1200 


35 


36 


! iv. 

f 


Expenses connected with 
the procurement of 
original material. 


1000 


1000 


25 


30 


i 


Total: 


4000 


3350 


100 


100 



f 

i The distribution in X which mainly becomes the same even though the maximum/minimum 

j values for II and III are used shows that the disbursement to the supplier of the 

\ tape system constitutes only a small part of the total costs for a load of 300 

t profiles. The computer costs for the number of profiles included in these calcula- 

| tions amount to approximately 1/3 of the costs. The working expenses of the docu- 

I mentation center and the expenres connected with the procurement of the original 

] material are according to the above calculations! approximately the same 9 and 

j constitute an increasing part of the total costs for an Increasing number of profiles. 

| Without drawing too far-reaching conclusions from this distribution it is reasonable 

to assume that the conditions with regard to other commercially available documentation 
| services based on magnetic tapea do not deviate essentially from the service described 

| above* 

1 Stockholm, November 9, 1970* 

| Krllng Hagen, M«8c* 
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B. Doudnikoff 
President 

Dataflow Systems Inc. 
Betheoda, Maryland, USA 



i 



* 



I 

i 

\ 

i. 

( 

i' 

i 

» 

i 







SYNOPSIS 

To start a comprehensive new library or information center usually is an 
expensive operation. In fact, the expansion of coverage in an existing library or 
information center is also linked with relative high cost. A number of "packages" 
are available to help in these areas. These n packages n come in various forms, 
including magnetic tapes for processing on a computer, microform, hard-copy (paper or 
forms) and combinations of these. 

Introduction 

In an effort to assist librarians and information specialists (and to make 
some money) numerous organizations have assembled "packages." Presented here is a 
selected sample of these packages. Unfortunately, some very good ones are excluded. 
However, if the concept behind these "packages" is of interest to the reader it is 
relatively simple to pursue specifics in a particular discipline area. 

To present these sample "packages" the medium versus the discipline was 
considered. The medium won out. Therefore, if discipline is of key interest, 
selections must be made from each medium. Microform, computer magnetic tapes, and 
hard-copy publications are the media. For the purpose of thiB description the hard- 
copy services are being de-emphasized. 

Microforms 

The miniaturization of information and data onto a variety of microforms has 
been done for many years on a custom basis. As organizations realized the cost 
economies of mass production of microforms a new era of micropublishing has evolved. 

The specific microform (l6mm reel or cartridge/magazine, microfiche, etc.) 
will not be discussed here. If detailed information on the media and the required 
reader and reader/printer is needed, it can be obtained from the firms that sell the 
hardware or from the firms that sell the software (the "packages"). 

Listings of the selected microform packages arc shown as follows: 
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BIOLOGICAL ABSTRACTS ON MICROFILM 

BioSciences Information Service of Biological Abstracts, Philadelphia, 
Pennsylvania. 16mm cartridge, bar code, image count, serial sequence, external book 
indexes (computer produced)* Commercial lease* 125 cartridges* 1926 to present* 
Updated bimonthly* 

Two million abstracts of articles and reports in life science research, $1600 
first year with back files, $600 thereafter for current films, but the subscriber 
must also purchase the current hard-copy index for $800 per year* 

Biology Biostatistics Weapon Effects 

Biochemistry Biophysics Bioengineering 

Behavioral Biology Ecology Botany 

Zoology Forestry Food Technology 

Order from: BioSciences Information Service of Biological Abstracts, 2100 

Arch Street, Philadelphia, Pennsylvania 19103, Attn: Dr* John H. Mason* Telephone 
(215) L09-1100 extension 37 or 38* 

"CARDSET" LIBRARY OF CONGRESS CATALOGING DATA ON MICROFILM 

Information Design, Inc*, Menlo Park, California* 16mm cartridge* Separate 
index, also on l6mm microfilm* Commercial subscription* Over 100 cartridges* 

Current, updated every other week* 

\ Covers all English Language cataloging included in MARC (see magnetic tape 
description). Index stations can be leased separately* $38^0 per year for total 
system* 

Additional information from: Information Design, Inc*, Menlo Park, California 

9^025, Attn: Mr. Brett Butler* Telephone (4l5) 369-2962* 

CHEMICAL ABSTRACTS ON MICROFILM 

Chemical Abstracts Service, Ohio State University, Columbus, Ohio. 16mm 
cartridges, bar code, accession number sequence, external book index* Commercial 
lease* Current, updated 8-10 times per year* 

An English language abstracting and in'dexing service, Chemical Abstracts 
covers chemistry and chemical engineering wordwide* Estimated 1970 volume is 270,000 
abstracts from all sources of chemical literature* Indexes not microfilmed* Chemical 
Abstracts Service is a division of the American Chemical Society* 

Chemical Abstracts 
Chemistry 

Chemical Engineering 

Prices effective in 1971, |2,200 for next twelve months including back issues 
from 1907 to present, |l,550 for succeeding years, to current subscribers of printed 
issues of Chemical Abstracts* 
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Additional information from: Chemical Abstracts Service, Ohio State 

University, Columbus, Ohio 43210. Attn: Marketing Department. Telephone (6l4) 
293-1929. 

CUMULATIVE MICROFILM INDEX TO USGRDR 

Clearinghouse for Federal Scientific and Technical Information, U.S. Depart- 
ment of Commerce, Springfield, Virginia. 16mm reels or cartridges, sequential order. 
Government sale. 20 reels. Reissued in 1969. 

The Clearinghouse has been experimenting with an index on l6mm microfilm which 
cumulates the United States Government Research and Development Reports Indexes from 
July 1964 to July 1969. This index was developed on on experimental basis to use in 
handling general reference searches received by the Clearinghouse. 

The result is a product containing over 150,000 titles that represents the 
first comprehensive computer-generated index to the U.S. Government report literature. 
It allows reference searchers to browse this literature quickly for retrieval pur- 
poses. 



While the file is not polished, its continuity of coverage may make it useful 
to librarians and technical information specialists working in the information field. 
This file is now available to the general public. 

The Index is in two segments - July 1964 - December 1967 (Reels 1-13) and 
January 1968 - June 1969 (Reels 14-20) . 

Technical Reports Chemistry 

Electronics & Electrical Mechanical 

Engineering Engineering 

Social Sciences Mathematical 

Space Technology Sciences 

The Microfilm Index is available in the following format: 

Twenty 100-foot reels at $100 

Twenty 100-foot 3M cartridges at $135 

Twenty 100-foot Recordak cartridges at $135 

Order from: Clearinghouse for Federal Scientific and Technical Information, 

U.S. Department of Commerce, Springfield, Virginia 22151. Inquiries: Telephone 
(703) 321-8560. 

IDEP ( INTERAGENCY DATA EXCHANGE PROGRAM) 

Army/NASA IDEP Office, U.S. Army Missile Command, Redstone Arsenal, Alabama. 
l6mm reel or cartridge, bar code, accession number sequence, optical coincidence and 
computer listing type index. Limited availability to qualified agencies and 
contractors. About 285 cartridges. Updated quarterly. 

Covers aerospace and missile test reports and specifications of components 
and parts as prepared by participants other than original manufacturers. Also covers 
test procedures, reliability history and failure analysis. Contains about 30,000 



Chemical Engineering 
Aeronautics 
Astrophysics 
Physics 
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VIII Cormercially Available Services 

test reports and 15,000 calibration procedures. Uses optical coincidence index, and 
book index by subject. Sponsored by Army, Navy, Air Force, and NASA. Other IDEP 
Offices; Fleet Missile Systems Analysis and Evaluation Group (Code E-63) , Corona, 
California 91720, and Los Angeles Air Force Station, Code HQ SAMSO (SMSDl) , Los 
Angeles, California 90045- Limited to agencies and contractors in aerospace and 
weapon systems area participating in actual testing. 



Components 

Missiles 

Performance Tests 
Mechanical Properties 
Measuring Instruments 



Specifications 

Aircraft 

Mechanical Parts 
Calibration Procedures 



Reliability 

Tests 

Failure 

Test Equipment 



For additional information about IDEP, contact; U.S. Array Missile Command, 
Army/NASA IDEP Office, Redstone Scientific Information Center, AMSMI-RBP, Redstone 
Arsenal, Huntsville, Alabama 55809- Telephone: Area Code (205) 876-8720 

"INFACT" COLLEGE CATALOGS ON MICROFILM 

Dataflow Systems, Inc., Bethesda, Maryland. 35nun roll film. External "peek- 
a-boo" index. Commercial subscription. 20 rolls. Current, updated quarterly. 

Covers about 500 leading U.S. colleges and universities undergraduate catalogs. 
Index, "College Suggestor" also sold separately. Price range from 175 per year 
(regional coverage) to 1495 per year (total selected coverage). 

Additional information from: Dataflow Systems, Inc. 7758 Wisconsin Avenue, 

Bethesda, Maryland 20014. Attn; J.B. Malcom. Telephone (301) 654-9133* 

SHOWCASE MICROFILM LIBRARY FOR CONSTRUCTION (SML-C) 

Showcase Corporation, Detroit, Michigan. 16mm cartridge, sequential frame 
numbers, book and filmed index. Commercial lease. 100 cartridges. Current, updated 
quarterly. 

A complete microfilm library designed specifically for the construction 
industry and design profession, containing manufacturers’ catalogs, selected specifica- 
tions from the U.S. Government (including pertinent Federal Specifications, U.S. 
Department of Commerce Specifications, Federal Housing Specifications and Standards, 
Army Corps of Engineers Guide Specifications, and others) , and relevant information 
from industry associations, institutes, and government agencies. The file is indexed 
by product, keyword, trade name, and manufacturer’s name. 



Civil Engineering 
Construction Equipment 



Vendor Catalogs 
Construction Materials 



Specifications 

Construction 



Order from: Showcase Corporation, Showcase Building, 6230 John R. Street, 
Detroit, Michigan 48202, on annual lease for $2,800. 

UNITERM INDEX CHEMICAL PATENTS ON MICROFILM 

IFI/Plenum Data Corporation, Washington, D.C. 35mm and l6mm reel accession 
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Comercially Available Services 

number sequence. External dual dictionary index and magnetic tape search system. 
Commercial sale. 1959 to current. Updated monthly. 

All U.S. Chemical patents from Official Gazette classification and all 
chemically related patents from other classes. Filmed in IFI accession number 
sequence by years. Available in the following series to complete patents contained 
in the Uniterm Index: 



1959 - 1963 


(58,113 patents) 35mm reels 


*1,850, 


1964-- 1966 


(50,007 patents) 16mm reels 


900, 


1967 


(18,964 patents) 16mm reels 


515 


1968 


(l8,000 patents) 16mm reels 


515 


1969 


(20,000 patents) 16mm reels 


515 


1970 


(20,000 patents) 16mm reels 


515 



Also available : 

Patent Claims contained in the Uniterm Index: 

1950 - 1969 reel or cartridge 16mm $1,350. 

1970 reel or cartridge 16mm 200. 

Minor Terms contained in the Uniterm Index: 

1950 - 1963 (282,000 compounds) 

6 cartridges 16mm $2,200. 

Chemistry 

Chemical Engineering 
Patents 

Order from: IFI/Plenura Data Corporation, 1000 Connecticut Avenue, N.W., 

Washington, D.C. 20036. Attn: H. Alcock Telephone: (202) 296-4936 

U.S. PATENT SUBSCRIPTION SERVICE 

Clearinghouse for Federal Scientific and Technical Information, Code 52.12 
(Subscription Sales), U.S. Department of Commerce, Springfield, Virginia. 16mm reels, 
bar code, patent number sequence, no index provided. Annual subscription. About 250 
reels per year. 1966 to present. Updated as required. 

All U.S. patents are contained in the Offical Gazette, which serves as an index. 
Reels issued as accumulated, about one month after announcement. Back years from 1966 
are available at $895 per £ear. Annual subscription all classes, is $895 per year. 
General and Mechanical category only is $600 per year. Electrical category is $400 
per year and Chemical category only is $300 per year. 

Patents Mechanical Engineering 

Chemistry Electronics & Electrical Engineering 

Chemical Engineering 

Order from: Clearinghouse for Federal Scientific and Technical Information, 

Code 52.12, U.S. Department of Commerce, Springfield, Virginia 22151* Attn: Mr. 

Foster. Telephone: (703) 321-8507 
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VSMF DESIGN ENGINEERING FILE 

Information Handling Services, Division of Indian Head, Inc,, Englewood, 
Colorado. 16mm cartridge, bar code number frames, book and film index. Commercial 
lease. About 220 cartridges. Current, reissued quarterly. 

Catalog data, specifications, drawings, and test reports on parts, materials 
and services used in industry. Indexed and filmed by product groupings. 

Vendor Catalogs Specifications Engineering Drawings 

Vendor Part Numbers Test Reports Mechanical Engineering 

Electronics & Electrical Components Mechanical Parts 

Engineering 

Available on monthly lease for $480 per month, or annually for $5360. 

Order from: Information Handling Services, 5500 South Valentia Way, Englewood, 

Colorado 80110. 

VSMF MILITARY SPECIFICATION FILE 

Information Handling Service, Division of Indian Head, Inc., Englewood, 
Colorado. 16mm cartridge, bar code numbered frames, filmed index. Commercial lease. 
About 205 cartridges. Current, updated monthly. 

Specifications and book standards used by industry, covering 483 Federal 
Supply Classifications, MIL SPECS, QPL's, MIL STDS, JANS, FED SPECS, and FED STDS, in 
six military specification files. Numeric index on film. Also available in 8mra. 
Historical file also available. 

Specifications Federal Specifications Military Specifications 

Military Standards Federal Standards Standards 

Qualified Product Lists 

Available on 16mm on annual lease as follows; 



Order from: Information. Handling Services, 5500 South Valentia Way, Englewood, 

Colorado 80110. 

VSMF MILITARY STANDARDS FILE 

Information Handling Services, Division of Indian Head, Inc., Englewood, 
Colorado. l6mm cartridge, bar code, numbered frames, filmed numeric index. Com- 
mercial lease. 10 cartridges. Current, updated bimonthly. 

Microfilm cartridges of MS, AN, AND, NASA Standard Parts! MIL-D-1000, MIL 
Handbook H4-1, H4-2 and 300 on ground support equipment. 



Assemblies 

Electrical 

Instruments 

Mechanical 

Procedures 

All 



- 8450 

- 1120 

- 450 

- 1065 

- 165 

- 3350 
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Standards 

Military Standards 



Available on l6mm on an annual lea Be for 1560, 

Order from: Information Handling Services, 5500 South Valentia, Englewood, 

Colorado 80110. 

MAGNETIC TAPES (FOR COMPUTERS) 

Instead of disseminating information in microform some organizations have 
chosen the media of magnetic tape. Unlike, microform, magnetic tape information/ 
data must be processed on a computer to make it useable/readable. However, this 
added complexity also is the key asset of magnetic tape services. The information 
is "tailored" to local needs on the local computer. The microfilm is non- flexible, 
but is directly readable with minimal need for machines (outside of the relatively 
simple microform readers and reader/printer). 

A quick review of the better known services is as follows: 

BA PREVIEWS 

BioSciences Information Service of Biological Abstracts, 2100 Arch Street, 
Philadelphia, Pennsylvania. 

Subject Matter: Research literature of the world dealing with life sciences 

in any of their manifestations. 

Types of Source Items Input: Approximately 8000 journals are reviewed for 

input, and an average of 4500 of these are productive during a calendar year. Of 
these journals 69% of them are English language publications-. Journals represent 
84# of the total input. Monographs, published proceedings, theses and selected 
abstracts from the Referativnyi Zhurnal Biologia account for the remainder. 

Subject Analysis/Indexing: Items are assigned an average of 19 index terms or 

descriptors. Words from the title and abstract are used as keywords or descriptors. 
In addition, the title is enriched by adding descriptors and uncontrolled descriptors 
are selected from the text. No thesarus is used in indexing. 

Searchable Data Elements: Keywords, CROSS index (subject headings), 

Biosystematic index, author, Journal source (CODEN). 

Time Span Available: 2 years in BA Previews format, (ll years in machine 

readable form for inhouse searches). 

Frequency of Tape Issue: 5 tapes per month (BA semimonthly, Biol monthly) 

Average Number of Source Items Cited Per Tape: 6400 

Subscription Cost or Leasing Details: 

Tapes are available on lease. $3800 (includes 2 monthly tapes from BA of 
5800 citations each and 1 tape Biol of 7500 citations) lease price on site. Royalty 
for additional customer use is negotiable. Those interested should contact the 
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source. 



Software Availability: None 

Type of Inhouse Service Offered: 

CLASS (Current Literature Alerting ^earch Service), an SDI-type service based 
on each tape as it becomes available* The cost for this service is 1 100/yea r per 
search profile. Retrospective Searches are available. The cost for this service is 
|150 per search (with the right reserved to refuse a search at this price if it 
presents extraordinary problems)* 

Publications Produced from Base by Originator: 

B.A.S.I.C* , BioResearch Index , Annual emulative Indexes to Biological 
Abstracts , BioResearch Index , Abstracts of Mycology , Abstracts of Entomology * 

Additional Information from: BioSciences Information Service of Biological 

Abstracts, 2100 Arch Street, Philadelphia, Pennsylvania, 19103, Attn: 

Mrs. Phyllis V* Parkins. Telephone (215) 569-1100. 

PANDEX - CURRENT INDEX TO SCIENTIFIC & TECHNICAL LITERATURE 

CCM Information Corporation, New York, New York 

Subject Matter: 2100 scientific, technical and medical journals. 6000 

scientific technical books* 50,000 U*S. Government technical reports. 

Types of Source Items Input: 2100 journals have all articles contained in 

them entered into the data base. Of these journals 70# are English language 
publications; 30# non-English. Journal articles account for 80# of the data base; 

18# of the input is taken from government reports; 2# are monographs. 

Subject Analysis/Indexing: Each item has an average of 6 terms assigned to 

it. Terms are selected from a controlled thesarus. In addition to the indexing, 
titles entered into the record are "enriched" by adding descriptive words to the 
published title. 

Time Span Available: 1966 

Frequency of Tape Issue: Weekly 

Average Number of Source Items Cited Per Tape: 5000 

Subscription Cost or Leasing Details: 

Educational, industrial, government or quasi-government organizations using 
the tape for inhouse use: 16500* The soma price applies to non-profit regional 

information centers providing profile searches to customers at stated rates. Multi- 
user groups providing SDI service from central computer group to a group of companies 
in industry or a group of universities should contact CCM for price details. Profit 
information centers providing profile searches should also contact CCM. 

Software Availability: COBOL programs print-out and SDI. IBM 360/DOS. 32K 

core. 7 or 9 track 800 BPI. Coding is EBCDIC, BCDIC, or ASCII. Fixed field or 
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KARC II format. 

Publications Provided from Base by Originator: 

Pandex — Current Index to Scientific & Technical Literature. Bi-weekly 
publication. Free hard copy is supplied to tape subscribers* 

Additional Information from: CCM Information Corporation, 909 Third Avenue, 

New York, New York 10022, Attn: Richard P. Kollin. Telephone (212) 935-308?. 

*CA-COW)ENSATES 

Chemical Abstracts Service, The Ohio State University, Columbus, Ohio. 

Subject Matter: Bibliographic citations plus Keyword Index phrases from 

Chemical Abstracts issues. 

Types of Source Items Input: The tape is to provide searchable data items 

from the corresponding issue of Chemical Abstracts . , There is an accompanying printed 
version of the corresponding Chemical Abstracts issue. 

Subject Analysis/Indexing: Titles of articles and patents are enriched by 

adding descriptors. Words from the title and abstracts are used as descriptors. 
Uncontrolled descriptors or keywords from text are selected and added to the record. 

Searchable Data Elements: Words in title, keyword phrases, author name(s), 

and journal CODIU. 

Abstracts: None. However, the record does include shorter phrases which may 

be used in lieu of abstracts. 

Time Span Available: Available from July, 1968. 

Frequency of Tape Issue: Weekly. 

Average Number of Source Items Cited per Tape: 5000 (250,000 new articles 

and patents each year). 

Subscription Cost or Leasing Details: 

Tapes are available on lease-subscription at 14400/year plus costs of tapes 
and mailing. Interested subscribers should contact CAS as to type of lease/license 
for which they qualify. 

Software Availability: Contact the source regarding this item. 

Type of Inhouse Service Offered: Search of file based on customers search 

profile. Cost: 14400 plus charge for computer time used. 

Note: Further details describing services are included in Information Services 1971 . 

Available upon request from Chemical Abstract Services. 

Additional Information from: Chemical Abstracts Service, The Ohio State 

University, Columbus, Ohio 43210, Attn: Marketing Department, Telephone 
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(6l4) 422-1929. 

U.S. GOVERNMENT RESEARCH & DEVELOPMENT REPORTS 

National Technical Information Service, Springfield, Virginia 

Subject Matter: Broad coverage of all sciences, technical subjects and 

social sciences* 

Types of Source Items Input: Many of the documents announced in USQRDR 

are received from the Department of Defense (DOD), the National Aeronautics and 
Space Administration (NASA) and the Atomic Energy Commission (AEC). Contributions 
from other federal departments tn eke up the balance of input. 

Of the total input, approximately 16# is reprints resulting from federally 
sponsored research and published in professional journals; 10# is translations; 
and 3% is foreign language documents. 

Subject Analysis/Indexing: An average of 10 index terms selected from a 

thesarus of controlled descriptors, are assigned to each document record. Uncon- 
trolled keywords from the text are also part of the unit record. 

Searchable Data Elements: See page of data elements which follows: 

Abstracts: Abstracts provided for 73# of items. 

Time Span Available: Tapes available beginning with January 1970. 

Frequency of Tape Issue: Twice monthly. 

Average Number of Source Items Cited per Tape: 1800. 

Subscription Cost or Leasing Details: 

fl500/year for domestic subscription. 

Il835 for foreign subscriptions, including air mail coBtB. 

Tapes are available only on subscription basis. 

Test tape of USGRDR v. 69, no. 13, available at |25/tape. 

Software Availability: No programs for using the data will be made available 

by the Service. 

Type of Inhouse Service Offered: Limited search service now offered. Subject 
search done on only 20# of subject termB considered moBt central to document. Title 
listing and accession number provided. 

Publications Produced from Base by Originator: 

USGRDR , Clearinghouse Announcements in Science & Technology . U.S. Government 
Research & Development Reports Index (USQRDR-I). 

Further data in CFSTI brochure of 20 March 1970 which includes: Clearinghouse 

Announcement Journal Available on Magnotic Tape (Flyers); Subjects Covered; 

Data Elements; Description and Format for Clearinghouse USGRDR Magnetic Tape; Order 
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form for Sample Tap#. 

Additional Information frost National Technical Information Service, 5285 
Port Royal Road, Springfield, Virginia 22151 1 Attn: Marvin E. Wilson. 

CCMPBfDGC (Computerised Engineering Index) 

Dfiginaaring Index, Inc., New York, New York 

Subject Matters All fields of engineering and certain fields of applied 
science and management, pertinent quality research applications literature; assembles 
the various engineering disciplines side by side with interconnecting cross reference 

Types of Source Inputs More than 3500 sources of engineering literature — 
professional and trade journals, publications of engineering organisations, papers 
from conferences and symposia, books and other documents. 

Subject Analysls/lndexings It ems are assigned main headings, subheadings, and 
cross references, as required,' from "SHE* 1 (Subject Headings for Engineering), an 
authority list in use by Ei editorial staff. An average of 6 access terms (includ- 
ing subject headings and subheadings) are assigned input items. A maximum of 5 free 
language terms may be assigned. 

Searchable Data Elements: Users have the option of searching the complete 
record (full text search) or searching selectively on chosen data elements; Subject 
heading and subheading, document ID number, author (s), Ei abstract number, codes 
assigned to the CARD-A-LEKT service divisions, on most records the access words 
(which are cross references chosen for the printed version of Compendex), and on 
most records the free language terms. 

Abstracts: Abstracts are included. 

Time Span Available: 1969-1970 and subsequent years. 

Frequency of Tape Issue: Monthly. 

Average Number of Source Items Cited per Tape: 6000. 

Subscription Cost or Leasing Details: 

Tapes are available on lease only with subscription to Engin eering Index 
(monthly and annual issue). Cost for combined package 16800/year (for 1970: loOOO 
for lease; 1500 for combined annual and monthly indexes; 1300 for 12 tape reels). 
Commercial (for profit) organisations should contact Ei to determine Compenddx) 
availability and limitations on its use. 

Software Availability t Nona. (Tapfcla in TBCT-PAC input foraat. TDCT-PAC 
ia an IBM program for text eearcht available fro. IBM). 

Publicationa Produced fro. Base by Originator: 

None. 

For Additional Information: Compendex brochure, Engineering Index, Inc., 

345 Boat 47th Street, New York, New York 10017, Attn: John W. Carrigy, Telephone 
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(212) 752*6800. The brochure contains input format, data elements Information on 
TEXT-PAC and various details regarding this service* 

COMBINED SOURCE AND CITATION DATA TAPE 

Institute for Scientific Information, Philadelphia, Pennsylvania 

Subject Matters Broad interdisciplinary coverage of journal literature, 
including the primary journals of basic and applied science, engineering and 
technology, medicine, psychology and psychiatry, and the behavioral sciences. 

(Data appears on tape within 15 days after publication.) 

Types of Source Items Input: 2180 journals are input cover to cover 

and these articloa comprise 98# of the data base. The remaining 2# of the tape is 
from monographs, published proceedings, theses, etc. Certain regularly spearing 
symposia reports of proceedings, are routinely processed as they are published. 

50# of the base is from English language publications and 50# from non-Rigliah 
language publications. 

Subjects Analysis/Indexing: "Indexing" in the sense of assigning either 

keywords or descriptors is not done. It is suggested that in lieu of this field the 
reader consider searchable data elements. 

Searchable Data Elements: It should be noted that this tape includes not 
only the title of the original paper but all references cited in that paper. 
Searchable elements therefore include: 1) Citation line identifying any earlier 

published paper, book, patent, etc. 2) Words in a title including initial floating 
and terminal words stems or phrases. 3) Author currently publishing (the source 
author). 4) Authors whose earlier works have been cited (all cited authors). 5) 

The current journal title, i.e. the Journal in which the original article appeared. 
6) Earlier issue(s) of any journal(s) which has been cited in the references of this 
particular paper (cited journal). 7) Address where the work was done, i.e., the 
addresses of nil authors. 

Time Span Available:. 1964 to date. 

Frequency of Tape Issue: Weekly. 

Average Number of Source Items Cited per Tape: 6500 

Subscription Cost or Leasing Details: 

Subscription is f20,000/yoor; the use is restricted under subscription 
contract. 

Software Availability: Software packages ore available from the source of 

the tapes and details will be supplied upon inquiry. 

Additional Information: Institute for Scientific Information, 325 Chestnut 

Street, Philadelphia, Pennsylvania 19106, Attn: Melvin Veinstock. Telephone (215) 
923-3300. 
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MARC (MACHINE-READABLE CATALOG INQ) DISTRIBUTION SERVICE 
The Library of Congress, Washington, D.C. 

Subject Matter: Current English language monographic cataloging data. 

Type of Source Items Input: The entire input is taken from monographs which 

also includes government reports/documents and conference proceedings. 

Subject Analysis/Indexing: An average of 1.5 subject headings are assigned 

each item. The items are classified using The Library of Congress Classification 
Schedules. 

Searchable Data Elements: Each data element is searchable. Fixed and vari- 

able fields are included in the record. 

Time Span Available: The past 2 years. 1969*1970. 

Frequency of Tape Issue: Weekly 

Average Number of Source Items Cited Per Tape: 1,200. 

Subscription Cost or Leasing Details: Tapes are available on subscription 

for t800 per year. 

Software Availability: None. 

Type of In-House Service Offered: MARC tape BearcheB are performed in-house 

on a time available and cost recovery basis. 

Additional Information: The Library of Congress, Washington, D.C. 205^0, 

Attn: Henrietta D. Avram. Telephone (202) 426-6068. 

INFORMATION ON MARC MAY BE FOUND IN THE FOLLOWING PUBLICATIONS: 

The MARC Pilot Project: Final Report on a Project Sponsored by the Council on Library 
Re source s t Inc. 1968. (l53p77 For sale by the Superintendent of Documents, U.S. 
Government Printing Office, Washington, D.C. 20420, at 13*50 a copy. SC1.2:Ml8/2. 

A detailed description of the format, character setB, bibliographic codes, input 
procedures, and cost of production of MARC records during the experimental MARC Pilot 
Project. Report includes summary descriptions of computer programs used and, in an 
appendix, reports written by each of the participating libraries that received and 
used the weekly magnetic tapes. 

MARC Manuals Used by the Library of Congress ; prepared by Information Systems Office, 
Library of Congress. 2d ed. Chicago ALA, ISAD, 1970 For sale by the American 
Library Association, 50 East Huron Street, Chicago, Illinois 606 II, at 112*50 a copy. 

This 4-part volume of more than 300 pages contains three handbooks and one special 
study. The first handbook "Books: A MARC Format," provides specifications for 
magnetic tapes in the MARC II format. It is designed for programmers who will be 
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implementing systems using MARC records. The "Data Preparation Manual: MARC 
Editors." is a detailed guide to procedures followed by the MARC editors at The 
Library of Congress in preparing bibliographic records for conversion to mschine- 
readable fora. The ’Transcription Manual" provides siailar formation for the oper- 
ators of the magnetic tape typewriters used in the MARC ays tea. The special study, 
"Computer Magnetic Tape Usability Study," provides a list of data processing equip- 
sent which can handle MARC tapes. 

BOOKS 



A MARC Format : Specifications for Magnetic Tapes Containing Monographic Catalog 
Records in the MARC 11 For mat . Hth ed. April 1970* CTO p.) Distribution free upon ** 
request to 1C Card Division subscribers. 

This volume is a revision of the document formerly published under the title: 
Subscriber^ Guide to the MARC Distribution Service . It describes the tape and 
record formats, the character set, and the data fields in the MARC records. 

Serials: A MARC Fonaat . Washington, Library of Congress, 1970. (?2p.). For 
sale by the Superintendent of Documents, U.S. Government Printing Office, Washington, 
D.C. 20402, at f.?0 a copy. LC1.2:Kl8/7. 

Maps: A MARC Format . Washington, Library of Congress, 1970 (45p.) For sale by 
the Superintendent of Documents, U.S. Government Printing Office, Washington, D.C. 
20402, at 1.05 a copy. LC1.2:Hl8/6. 

Motion Pictures . Filmstrips t and Projected Meida Intended for Projection: A 
MARC Format, Washington, Library of Congress, (in preparation). 

CONCLUSION 

It is interesting to note that many of the information collections are avail- ' 
able in both microform and magnetic tape form. Which is selected depends on the end 
use, as well as on the availability of computers, etc. 

Most users of the listed services and other siailar services agree that 
internal operations are enhanced considerably from viewpoints of tine, money, man- 
power and convenience by using these "packages." However, a caution must be noted, 
that in the construction of these packages a generalization has been built in to 
make these acceptable to a relatively wide audience of subscribers. This generaliza- 
tion can prove to be disconcerting for people that have always done things in a 
precise fashion uniquely for their own organization needs. 

In brief, the "package" concept of information services can be of great as- 
sistance to getting started rapidly and/or expanding existing activities with wide 
yet directed coverage. 
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Session Eight - 

GOIMRCIALLY AVAILABLE SEBVICES 
Chaiman: Mr. D f G f Kingvill (South Africa) 



DR. E. GARFIELD (VS): The fact that beginning with January 1972 there will ba a 

weekly aubjact Index to Currant Con tan ta will hava a very significant Impact on com- 
putar services* One of the critical problems that computer oriented services have la 
timing* Computer aervicea hava become popular because In their own rather coatly way, 
they have overcome time barrlera which atood in the way of printed aervicea* When- 
ever traditional aervlcaa bscone competitive in tine and cost, and certainly they are 
competitive coat vise* the computer service will either have to Improve In terma of 
timing or In tarns of coat* 

We are the largest producer of a magnetic tape base In the world* We cover approxi- 
mately 400,000 articles per year* Many centers throughout the world, however, do not 
use our services. The main deterrent to greater use of our data base, which la Inter- 
national, multi-disciplinary, and complete In the coverage of 25,000 journals, la, 
among othar things, that additional journal coverage la desirable* 

The data base that we have constructed la baaed upon an analysis of citations which 
- shows those journals which are most frequently cited In the scientific literature* 1 
will soon be publishing a Hat of 500 most Important journals of science* 1 am leaving 
with Mr* Keren In Israel a copy of a list of journals which 1 presented a few weeks ago 
at the Cranfleld Conference In England* This list la Important to show what the usual 
patterns of literature are* But although these 500 journals account for about 75% of 
all relevant literature, 1 do not underestimate the importance of additional hundreds 
and thousands of journals, that people want covered In specific situations* 

Dr* Brown of the National Research Council of Canada gets from us a' data base 
covering 25,000 journals and 1 am sure he can justify the need to cover 500 or 1000 
more In various areas that ere of Interest to his users, and similarly Mr* Skov and 
others from Scandinavia* We know that the group at the Royal Institute of Technology 
In Sweden are processing journals on their own, to satisfy certain special needs* 

If you can 9 t justify the cost of Importing journals on your own, then It Is purely 
a matter of economics to come to 1S1 or any other data base supplier and say: here Is 
a list of journals that we want you to add and here Is the money to pay for It* 

We do what Is economically feasible but we cannot perform miracles* If we had 
large Government subsidies we could do more* 1 might point out that almost all the 
data basea that we are talking about today could not be offered at the prices at which 
they are offered ware It not for the feet that they are by-products of other services 
we ars producing like Science Citation Indsx, stc* After all. It Is Inconceivable that 
a data base which costs millions of dollars psr year to produce, as ours doss, could be 
sold for $8,000 to a doten or more users. 

As producers, ws are ready and able at any tims, whether through UNIS1ST or just In- 
formal discussions with our present or potential users, to add to this file any number 
of journals they want* 
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DR. J, BROWN (Canada): My paper was written several months ago. The service in 

Canada is developing rather rapidly, so that techniques have since changed somewhat 
and the figures stated in the paper have also changed. 

The National Science Library operates as a division of the National Research 
Council of Canada. The name "Library" 1 b misleading in this case, particularly if 
you Interpret the word library in a traditional manner. 

The National Science Library is really an information transferral agency whose 
job is to develop techniques to provide scientists, engineers and research workers 
with information on publications which they need in their day to day work. 

The National Science Library originated as a National Research Council Library, 
but since its Inception, it has been operated as a National Science Library. Its 
resources have been developed in very close collaboration with other major libraries 
in the country, ao that the resources and the services provided by the National Sci- 
ence Library complement and supplement resources and aervlcea of other libraries. 

The SDI service is regarded as an extension of the Information servlcea provided 
by the National Science Library* It Is a very small part of It. 

The SDI service, aa It is now, operated on an experimental baals for about three 
years. Before being offered aa a National Service we did a market analysis and at- 
tempted to find out of what use to the population it might be, what the Intereata of 
the users are, and what price they would be willing to pay for such a service. 

We are now using 8 tape aervlcea: Chemical Tltlea; CA Condensates; ISI Source and 
Citation Data Tape; INSFEC; Biological Abstracts Previews; MEDLARS; GEO. REF; and 
MARC Dlatrlbutlon Service. Recently we have added the ERICTAPES, the COMPENDEX, and 
Metals Abstracts Index tapes* That makea 11 tapea. 

We discovered very early that theae tapes are incompatible as to software and 
hardware, ao that one of our big problems waa to develop a data baae which would eli- 
minate the necessity to rewrite prograMsea every time we added a new data baae. 

A great deal of time, effort and money was spent in developing what we call a 
standard file format, which enables us to take any data base and reprogram it to this 
standard. 

The other problem we were faced with was the development of Interest profiles. 
Here again we discovered early in the game that it is very difficult to develop in- 
terest profiles by telephone, by correspondence and so on. What la really required 
is somebody who is familiar with the techniques of SDI, who has a subject backgrou : •.(' 
and who can sit down with the user and develop hla Interest profile. 

So we started a training program for what we call search editors. We offered a 
two day seminar in Ottawa to teach search editors how to use the SDI program. There 
was a great Interest in this and people were willing to come to Ottawa at their own 
expenae. 

To date we have trained about 250 people located in university libraries, company 
libraries, industrial libraries. Industrial Research Council libraries. Government 
libraries, etc. Training of search editors is a crucial matter in thla program and 
we are inclined to extend this further, and to train them on a more formal basis. 

They will be trained and paid by the National Science Library, and then located in 
specific reglona across Canada, where their job will be to conduct local seminars, 
aaalat in the development of Interest profiles, and travel around the country and 
aell the aervlce. 
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We have found that some of the bate search editors ire good reference librarians. 

The techniques in constructing interest profiles, whether for Mechanized or conven- 
tional systems, are the same. 

When the service was first offered on a national scale, over two years ago, we 
did not hsve good cost figures. The price we charged initially was $100 per year, 
plus an additional $15 if the person wanted to use more than one data base. We re- 
cently hsve changed these prices because we now have a very good picture of what it 
is costing us. We find that the cost at the present time is $2 to $2.50 per profile, 
per search. The new price schedule represents our total cost; that is, computer time, 
personnel, paper, mailing charges and so on. It does not Include the costs for leasing 
the tape services we are using. 

We reasoned that as when compiling a bibliography, we do not charge the user for 
the cost of the literature used, we should not charge the user for the cost of acqui- 
ring tape services. 

After the service had been in operation for a little over a year, we attempted to 
evaluate it. At present we have about 800 subscribers serving about 2000 people and 
we circulated a questionnaire asking which data bases they would like to have added. 

We also attempted to find out whether this service was saving the user any time. 

We came up with rather astounding figures. People who answered the question were 
saving about 39,000 manhours per year. Converted into dollars, this is quite a sub- 
stantial sum for relatively few people. 

Some of the users even indicated that they no longer used conventional search tech- 
niques which we were not too happy about. People should continue to use normal proce- 
dures for literature searches and our service as a supplement to catch material that 
might have been missed. 

One of the repercussions of our service is that the demand for literature loans 
has skyrocketed. When we initiated the service we promised subscribers that the Na- 
tional Science library would hold all the journals which are cited. There is nothing 
mote frustrating for a research worker receiving a list of citations than to find that 
he cannot get the original papers. 

The figures I have given here represent the use of a service which has not really 
been publicized. We were operating under austerity conditions so we had to be rather 
cautious, but we are now taking steps to expand. In the survey of user reactions we 
found that there was a great demand for retrospective searches. 

The only data bases which are available today to carry cut retrospective searches 
are MEDLARS and the ISI Citation Tapes. A current awareness search is quite a dif- 
ferent matter from a retrospective one. You cannot use the same program. You can 
run the tapes sequentially, but when you have a tape coming out monthly and you have 
five years accumulation, searching sixty tapes is a very costly business as far as 
computer time is concerned. 

So we are struggling to develop a technique whereby we can do retrospective searches 
at a reasonable cost. 

Mfl. fl. liKOV (Denmark) : I am going to represent 

the work done in Scandinavia regarding the real 
magnetic tapes. 

We hsve about 5 yesra experience in running 
advisable to make a common pricing policy. But 
actually costs to run a tape service. 
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CA Condensates were chosen for analyala with fifty or alternately three hundred 
annual search profiles. Based on practical experience we assumed that that cost of a 
service to Industry and research can be divided into four components: (i) disbursement 
to the supplier of the tape system; (ii) working expenses of the documentation center; 
(iii) cost of electronic data processing; (iv) expenses connected with the procurement 
of original material to the subscribers. 

We organised four working groups. The first had to find out what would be the 
cost of a system operating for either fifty or three hundred profiles. The second had 
to find out the cost of all the practical work of the documentation center; the organi- 
zing of the system, selling the profiles, instructing people, and so on. The third 
dealt with electronic processing, which can be done by having your own computer or by 
buying time. The fourth investigated the cost of material per profile. The findings 
are stated In my paper. 

The cost of one profile is about $800 where 50 profiles are served, and about 
$700 for 300 profiles. Costs will go up. Costs of documentation work depend on the 
efficiency of the service. To serve users well you must expect rising costs. Better 
programs, faster computers might make it easier in the future, but since the data base 
constantly Increases, more machine time will be required. 

The procurement of literature is indeed an increasingly expensive business. Sup- 
plying the customers with a copy of the original article requires time; we often have 
to write to remote cities, we have mailing expenses, maintenance of mailing lists, etc. 

Our figures are a valuable aid to policy makers who decide whether or not to pay 
for a service. In Copenhagen we buy electronic processing time from a big computer 
center. We intend to cooperate, as far as possible, internationally, with other cen- 
ters in Europe. I expect in the future that we shall exchange or buy profiles. 

There are now about forty to fifty tape systems in the world. From some of the 
systems we could serve only one or two profiles in Denmark. That would not pay, so 
we buy them. If our figures are correct, a profile in chemicals costs about $800. 

Three of these profiles cost lbout the same as a subscription to the printed period- 
ical. Does it pay for a firm to pay for a subscription or to use a few profiles? 

We tried to see it from the users point of view and we have made this survey to in- 
fluence to some extent the data base manufacturers and also to give some figures to 
our financial authorities. Every country will have its own policy regarding how much 
shall be subsidized by the State and how much shall be paid by customers. The figures 
presented here are based on Scandinavian prices, but they have some use for other 
countries as well in their attempts to come to a sensible pricing policy. 

MR m KIN GW ILL : The question is how many subscribers should a national agency have to 

make it worthwhile to subscribe to one of the commercial services and provide a ser- 
vice on a national basis as is done in several countries now. 

The other aspect which concerns smaller countries particularly is the question of 
making the conventional sources available. When you contemplate subscribing to these 
tape services and providing the SDI service based on them, you are faced wfth the 
problem of providing the hard copy as well. 

In South Africa this relates tc the problem of building up scientific and technical 
information sources. The commercial services aim, ao Dr. Garfield has pointed out, at 
providing information from the most important journals to the greatest number of users. 

In him paper, Mr. Doudnikoff says that the medium versus the discipline was 
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considered. The medium won out, Therefore, if discipline is o C key lattiwt, selec- 
tions mat bs node from ssch nsdiun. Microform* magnetic tapes* and hsvi espy puhli- 
cstions art tbs media. For the purpose of bis description the bard espy services ants 
being ds e m phas ised* H ow e ve r, I want to bring to your attention tbs results ef a sur- 
vey made by one of ay stuff concerning the problaa of tbs hard copy. 

She found that the tape of the American Geological Institute (OH* IP) ente rs - 
items from about 1000 journals. Of these, 300 have all articles entered. STH tepee 
of the AIF dealing with all areas of physics end astronomy enters articles from 
journals* Mathematical Offprint Service enters all articles from about 170 Journals. 
Mathematical Ievlevs reviews 1,100 journals of which 180 are entered cover to cover. 
Basic Journal Abstracts of Chemical Abstracts enters articles from 35 leading Jou r na ls 
in chemistry and chemical engineering. Chemical Titles processes 700 and the CITE tape 
of Engineering Index covers 300 "core" technological journals in plastics and electro- 
nics engineering. 



This gives you 



idea of the problem we are faced with. 



MR* F* URBESMI (UK): To Dr * Garfield* 1 was rather distressed by your forecast that 
the costs are going up. I should have thought that in an economic organisation* an 
efficient organisation, the larger the turnover, the lower the cost. Vfay not? 

DR* GARFIELD: To Mr * Liebeony . I do not know where you got the impression that 1 said 
the prices are going up. On the contrary, what I said was that if the sise of tbs data 
base is to be increased beyond what it already is, we will be glad to enter into in any 
kind of cooperative arrangements with the tape suppliers whereby we can add to the al- 
ready existing coverage of 2500 journals, if they want to help support the additional 
cost of input. I did not say that ISI had stopped adding journals, 

Ve operate the ASGA system. Anyone who wants to, can subscribe tomorrow for about 
$135 a year in Israel, $115 in the United States. Incidentally, Mr, Skew might compare 
that to his cost of $800. I do not know the details of what he is searching for $800, 
but it seems to me that since ve have a range of profile users beginning at $100 and 
going up to $2000 per year, that our average custoner spends about $150 per year per 
profile. 1 an confident that we can provide weekly turnouts where the customer cam 
sit at his desk and throw his own computer out. 

In the first year of operation, we covered 600 journals. Without increasing the 
price during six years, ve have increased the coverage to 2500 journals* That* In 
spite of inflation, is not an increase, but a very significant decrease in price* 

There is no pri< e increase contemplated for ASCA in the coming year. Os the con- 
trary, ISI tapes will certainly not go up, and nay even go down* So the number of 
users of our tapes definitely affects the future and present pricing* Every tine we 
get a new user, whether it is at $150 a year or $2000 a year, there is an allocation 
made for new journals. Whenever we get a substantial new user, let us say s subscrip- 
tion for tapes from somebody interested in petroleum, we would stipulate the coverage 
of X number of petroleum journals for that particular user. 



MR* H. SCHUR (UK): To Dr* Brown and Mr* Skov , Could you give us sons def I Is of who 
your users are? Where do they work? Are they at universities, or in industry, and 
if in industry, are they in the large firms or in the small ones? 1 am particularly 
interested because in Denmark and in Canada the technical information ssrvics is 
something on the lines of the industrial liaison officers in ths HI, 1 would* there- 
fore, like to know whether any computerised services are also used by the TIB services 
end not just by the universities. 
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DR . BROWN: To Mr, Schur . The SDI service operated by the National Science Library is 
a very personalized service. At present, users from industry comprise about 20X of 
the total user population. This figure is going up. The data bases that were avail- 
able at the time we started our service were heavily oriented towards the basic sci- 
ences. As we are now gradually adding other data bases, such as COMPENDEX, INSPEC, 
and so on, we are acquiring more and more users from industry. 

TIS - Technical Information Service - is operated by the National Research Council 
and works in close cooperation with the National Science Library. TIS has also develop- 
ed an SDI service based on the programs developed by the National Science Library. It 
is a less sophisticated system because it is directed towards an individual industry, 
a textile or furniture manufacturing industry, etc. 

I suspect that many of the individual users of the NSL SDI service are also pro- 
fiting by the SDI services offered by TIS. 

An amusing thing came up in connection with cost. We have a very sophisticated 
system which searches the contents of about 9000 journals at a fantastic speed, over 
weekends, and the print-outs are sent out every Monday morning. We ran into a bottle- 
neck because we had great difficulty in speeding up the mailing of these print-outs. 

We had to get a rather large clerical staff to tear the sheets apart and stuff them 
in envelopes. However, the problem is not yet solved. 

MR , SKOV : To Mr • Schur • There is an approximate 50-50 distribution between research 
people and people from industry. Our system in Denmark hinders researchers from 
using more profiles because they have very small local budgets for purchasing infor- 
mation. In England students in their last year can get profiles without fee to get 
accustomed to this new system. Unfortunately we in Denmark cannot afford that for 
the time being. 

MR, D,H . BARLOW (UK) : To Mr, Skov, I think one of the main points that came out in 
your remarks, was the ratio between printed publications and tape services. Tape 
services are generally a spin off. As the number of centers using tapes increase, 
so will the provision of information to the various users, who are traditionally 
users of the printed publications. I think there is a very real problem that has to 
be faced by the users of inf ormation, that the producers have to maintain the balance 
between information produced through retail outlets and through wholesale. This has 
not been taken into account in previous studies. 

MR, SKOV; To Mr, Barlow, As to the modem systems as against traditional ones, we 
have had some customers who have cancelled their subscriptions for quite a lot of 
periodicals. Maybe we killed the basis wc work on. I do not know what will happen. 

The new system has great appeal to some people and many who never did very much con- 
ventional reading, do read now. 

DR, GARFIELD: Mr. Kingwill mentioned the difficulty in deciding when a center is 

ready for tapes. How many profiles does this require? There are many centers which 
can Justify tapes for a very small number of profiles for reasons of confidentiality. 

We have centers using our tapes who may have as few as 30 or 40 users. They do not 
want to buy a commercial service because they would have to submit their profiles 
which they prefer not to do. It is true that by examining profiles you can find 
out quite a bit about what the firm is doing. Some military organization might not 
particularly relish the idea. Also, the size of the individual profiles is very 
significant. 



664 




DiQQUQ8iori8 



The final point regards the implementation of tape services. Every Installation 
that ve make, unless the user is sophisticated enough, or prefers not to have it, is 
supported by a training program. Ve do not sell any service that la not backed up 
by the proper Installation and continuing training. Even the customer of a $150 
profile gets help in the construction of his profile. 

1 might add that I spend more than fifty percent of my time on the problem of 
the so-called leas developed countries. 1 do not consider Israel one of these coun- 
tries, but in some South American or African countries, 1 would never recowend the 
installation of a tape system. They can, through our own commercially available 
services, buy, for $150 per profile, something which will be much easier for them 
to use and thus not be confronted with all the problems of programing, training com- 
puter people, training information scientists, etc. On the other hand. If there are 
political or other reasons why they must have a computerised service, we will send 
key people to those places to help then install those systems. 
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EVENT INTERACTION ANALYSIS OF ISRAEL AND OTHER MIDDLE 
EAST COUNTRIES USING THE FACILITIES AND DATA OF THE 
WORLD EVENT/INTERACTION SURVEY (WEIS) 

Emil Schafer 

University Computi ng Center 
University of Southern California 



SYNOPSIS 

The use of four generalized programming languages to collect, structure, and 
analyze data related to political and other events or interactions among 
countries of the world is described. The data management language GIS is 
used to construct and maintain hierarchic files from which data is condition- 
ally selected for statistical analysis using BI-MED and SPSS. Event data 
for Israel , UAR, Jordan, Syria, Lebonan, and Iraq are analyzed for the 
4-year period 1966-69; chronology of events for June 1967 is listed, and 
frequency distributions of acts by country are plotted for the same month. 

Introduction 

The study of international political systems has become a major area of research in the 
field of international politics. This paper is a report of a project to incorporate such data into 
files capable of being conditionally queried for quantitative data analysis and qualitative review 
of event interactions. The relative ease and power of high-level data management languages to- 
gether with packaged statistical programs are illustrated. These methods and techniques have 
been taught to classes of graduate students who have been able to use them to describe and con- 
struct data bases for individual and group research projects. In this way, students and other re- 
searchers with no prior exposure to data processing have been able to do fairly sophisticated data 
manipulation and to effectively increase their time available for interpretive study. 

The World Event/Interaction Survey (WEIS) is a program of education and research in 
the Department of International Relations at the University of Southern California whose purpose 
is to develop the ability to describe, trace, compare, explain, and hopefully, to predict, under 
theoretical discipline, a particular range of international behavior by means of indicator data. 

The research is expected to contribute findings that might shed light on external performance 
traits and tendencies of countries, and also on configurations of actions and responses in the 
political exchanges among countries. The central topics are: how are problems coped with, and 
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how are conflicts generated and resolved. 

WEIS is also a data gathering and data analysis program to analyze structures and 
processes of actions and responses at what is defined as the "national control level " of interna- 
tional politics. Each data item is defined as a report of an international event, e.g. , 

ITALY EXTENDED A TWO MILLION DOLLAR LOAN TO MOROCCO. 

SYRIA REPORTED THAT SYRIAN PLANES SHOT DOWN 5 ISRAELI JETS. 

There are almost 35,000 such data items presently in the WEIS files. These have been extract- 
ed from the descriptive and explanatory materials of news stories contained in daily newspapers. 
The problems, issues, contests, conflicts, and adjustments which arise in international political 
relations are the subject of WEIS data gathering and analysis. Each such data item is defined as 
an event/interaction. These are included as long as they relate to specific and discrete acts 
which have been committed. Excluded are editorial reports, speculations on future international 
affairs, and generalized reports on series of events. The WEIS is primarily interested in what 
nations do as they act and respond in international political situations. 



Source Data Descriptions 



Two source tape files were received for this demonstration project: the WEIS Analytic 
fits and the WEIS Descriptive file. The Analytic file is a file whose fields are compatible with 
access by several statistical programs; the Descriptive file contains short abstracts or descrip- 
tions of the event interactions. For each record in the Analytic file, there should be a corres- 
ponding record in the Descriptive file. These two source files were constructed of 80-column 
(or 80-byte) card images containing the fields shown in Table 1. 



ANALYTIC FILE FIELDS 


DESCRIPTIVE FILE FIELDS 


YEAR 


MONTH 


MONTH 


DAY 


DAY 


YEAR 


ACTOR 


DESCRIPT 


EVENT 


COLLID 


TARGET 


SERIALNO 


SOURCE 


LINEID 


CODER 


DECKTYPE 


ARENA 




COLLID 




SERIALNO 




LINEID 




DECKTYPE 





Table 1. Analytic and Descriptive Source File Fields. 
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The fields ACTOR and TARGET contain numerical codes which designate countries 
initiating and receiving some action; the type of action is coded in EVENT. The field ARENA 
localizes the area in which the event took place, while SOURCE codes the newspaper source of 
the data. The collection identification is encoded in COLLID, while the individual coding the 
raw data is designated in CODER. The field LINEID has meaning only for the Descriptive file 
where several source cards may constitute one logical record. DECKTYPE identifies the file: 
a "1" is the Analytic file; a "2" is the Descriptive file. The fields YEAR. .MONTH, DAY, and 
SERIAI.NO are self-evident. (Details of the coding used for both files are contained in Ref. 1). 
Figure 1 , below, contains several examples of the types of coding used . 



[ An example showing the same record, as it might be coded for both the Analytic and Descriptive 
I formats is shown in Fig. 2. 

I 



DESCRIPTIVE 




10 17 68 VTN REJECTS USA DEMAND FOR RECIPROCITY FOR 
10 17 68 BOMBINGS OF VTN. 


12064212 

12064222 


ANALYTIC 




68 10 17 816 111 002 01 26 025 


12064201 



Fig. 2. Source Analytic and Descriptive card images. 



ACTOR or TARGET EVENT/INTERACTIONS 



002 USA 
070 Mexico 
305 Austria 
651 UAR 
666 Israel 



011 Surrender; yield to order; 
submit to arrest . 

012 Yield position; retreat; 
evacuate. 

013 Admit wrongdoing; 
retract statement. 



SOURCE 

01 New York Times 
05 Middle East Journal 
Chronologies 
12 Times of India 
17 Middle East Journal 



COLLID 


ARENA 


02 


Berlin crisis 


010 Arab- Israeli conflict (general) 


17 


Middle East 


013 Events during 1967 


12 


WEIS 1968 


020 Vietnam conflict 


24 


Sino-Soviet interactions 


050 Sino-Soviet conflict 
1 60 Red Guard activities 



Fig. 1 . Representative code and decode values for Analytic and Descriptive fields. 
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Creating the Analytic Data Set 

To create the new Analytic data set, two file descriptions were written using the GIS 
data description language. The first DDT (or Data Description Table) describes the source file; 
the second, the new file to be created. The DDT for the source file is shown in Fig. 3, while 
that for the new Analytic file is shown in Fig. 4. 



The new Analytic file is described as a one- 
level GIS file, like the source file, except in 
two respects: (1) Each record is compressed 
in size (from 80 to 30 bytes) to remove inter- 
vening blanks, and (2) The file is defined to 
be a generation data group (GDG) which makes 
it possible to use GIS for easy file mainten- 
ance functions. It was also desired to provide 
automatic field decoding facilities for all the 
coded fields . This can be done by including 
decode tables in the DDT, (see Fig.4). 

For example, if the coded value "09" is pre- 
sented for the field MONTH, the decode func- 
tion will print out "SEPTEMBER". Again, if 
the field ACTOR has the coded value "130", 
the decode function will generate "ECU", the 
acronym for the country Ecu dor . The keyword 
ENTRIES=2 in the DAtM statement indicates 
that GIS will maintain two generations of the 
file. The SEGM or segment statement serves 
to collect all the field statements above it into 
a GIS segment. The decode functions may be 
easily controlled by the user, i.e. , they may 
be turned off when not wanted. 

By providing additional DDT's 
for the same file (the same DSNAME), it is 
possible to readily describe the file different- 
ly. For example, if one would want to have 
the field names print out in words of another 
language, a simple addition to the field state- 
ment makes this feasible. 

The final step in the creation 
of tlie new Analytic file was to write a small 
program in the GIS procedural language to de- 
fine the mapping of fields from one file to another. This mapping is usually partitioned on a 
segment basis . The conversion program, as shown in Fig. 5 , reads source file records into a 
special GIS file (H0LD5) where they may be sorted. The word QUERY initiates all the process- 
es of locating the data set in the system catalog, requesting tape mounts, opening files, etc. 
The word SORT defines and initiates the sorting actions. Finally, tlie word CREATE initiates 



DPT; 


| 


FILE: ’!AME='..'E 1 GATAP; 


1 


FLD: HAKE-YEAR, 


LENGTH"”; 1 


FLP: HAI-T-BLANKl, 


LENGTH- 1; I 


F LH : NAI’E-HONTII, 


LEHr,TII=2; I 


FLD: ! , /. , ‘!>CLA?!K2, 


LEIIGTII=1; 1 


FLU: i.’/ViODAY, 


LENOTH-2; 1 


FLP: NAI‘E-DLAN!C3, 


LENGTH-1; 1 


FLP: ;!AHE»ACTOR, 


LENGTH" 3; 1 


FLP: VAi:r=PLAf.T.l;, 


LENGT!! = 1; 1 


FLP: mahe-CVENT, 


LENGT!1 = 3; 1 


FLP: !!Ai'E*P»LA!.'K5, 


LENGTII-li I 


FLP: 1,'AI‘E "TARGET, 


LENGTH-3; 1 


FLP: NAVONLANKC , 


LE!!PTI! = S; 1 


FLP: MAHE-SOUP.CE, 


LENGTH"”; I 


FLP: ;.'At't:=r,l.A:i!C7, 


LENGTH"]; 1 


FLP: HAHE-CODER, 


I.E!!GT!! = 2; 1 


FLP: ! !AHE»BLAHK", 


LENGTH"?" j 1 


FLD: '.'AHE-ARENA, 


LENGTH"?; 1 


FLD: : , . , ‘T. = DI.A;*!'.", 


LENGTH-1; 


FLP: NAME-COLLID, 


LENGTH"”; 1 


FLP: MA.HC-SERIALIIO, 


LENGTH"*:; 1 


FLP:. MAI5E-LINEID, 


LEMGT! ! = 1 ; 1 


FLD: !!A!'E"PECKTYPE, 


LENGTH-1; 1 


SCGHtHAPC-AMALYTIC, 


LEVEL=P0, ; 1 


ftype=p.ecop.d, unhid 


=Y , ; . 1 


PSORT-YEAR, A, SERIAL 


NO, A; I 


* s I 

DAT!!: DSORG=FS, CPEATE=l!0, CATLG-; 1 


PYE3, UN 1 T = 2li 00, PCD=(PEN=3), ; 1 


v LP.ECL = C0, RLKS 1 ZE = 


3200, n e C FM=; 1 


SFD, VCLUI1E-SER-0010CG, ; 1 


f DS MAI IE -WE 1 S DAT A; 




END; 





Fig. 3. DDT for the Source Analytic File. 
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DI)T 5 

* THIS DOT OFF INFS THE WEIS ANALYTIC FILE! 

F ILF : NAME* WE I SAGI S; 

FLOS NANE*Y£M* LENGTH-2, JUST-L ; 

DFCD S TYPSPC-LKUP, LGTHA-2 • LGTHF-4, EE0VAL-66 1 1966 ,67, 1967,6ft, 196 A, | 
#69,1969, TO, 1070,71, 1971 .72, 1072; 

FLO: NAME -MONTH . LENGTH-2 , JUST-LS 

DEED: TYPSPC-LKUP, LGTHA-2, LGTHF-9, EEOVAL*? 

#01 
#02 
#01 
#04 
#05 
#06 
#07 
# 0 # 

#09 
#10 
#11 
#12 



JANUARY « 

FEBRUARY, 

MARCH , 

APRIL , 

MAY , 

JUNE , 

JULY ,; 

AUGUST ,{ 

SFPTEMBER, $ 

OCTOBER 
NOVEMBER 
DECEMBER ; 

FLO: NAME-DAY. LENGTH-2, JUST-L; 

FLD: NAME- ACTOR . LENGTH-3 , JUST-L; 

OECD: TYPSPC-LKUP, LGTHA-3 , LGTHF-3, CONVA-S, CONVF-S, ; 

#EE0VAL*002, USA,020 ,CAN ,040, CUB, 041 ,HA1 ,042 , DOM, 051 , JAM, 052, TRI ,053,; 

# BAR, 070, HEX, 090, GUA, 091 , HON ,092 , ELS, 093, NIC , 094, COS, 095 , PAN, 100, COL, I 
101 , YEN, 110, GUY, 130, ECU, 135, PER, 140, BRA, 145, BHL, 150, PAR, 155, CHL, 160, ; 
165,URUtl9B,AF^<l9,OAS,200,UNK,205,IRF,210,NTH,2n^OFLf212,LUX,; 
FLti.- ^V < ?71 f M0C'^ .^e>0'«5,SWZ ««« 

FLO: LENGVxX.305 

FLO* N*MEiW<Tmi LENGTH-, ^MP^f-L, HEAlu ^O^.v'tVFC* 

• ; ' / v ' ^ 

SEGN:NAH£-ANALVT1C, LEVEL -00. TVPE-RECORO, UNIND-Y* t 
• SORT-YEAR, A, SER I ALNO. At 

*t 

6S«R4-pS, CREATE -YES, CATLG-YES, DSNAME-HI83ARU.NEISANTR, 

NTR I ES-ToPCB-l OEN-3 ) • LRECL-30, 6LKSI2E-3000, RECFM-FBt 

enBT 



i 265, 



Flg.A . ODT for the new Analytic file incorporating decode tables for all fields. 



the creation of the new file as defined by the mapping of the STRUCTURE statement. The verb 
INSERT implies the insertion of the entire segment on a non-duplicating basis. The last step 
is not a part of the file conversion; it merely lists out the first 25 records of the new file for 
visual inspection. Fig. 6 shows a brief listing of these records in an automatic GIS format pro- 
vided from the information stored in the DDT. Note that the decode table for the field ARENA 
includes a print value of FIELD EMPTY for an empty field; otherwise GIS will indicate this 
condition by a string of asterisks. If there is no decode function for a given argument in the de- 
code table, GIS will indicate this with a string of pound signs (####). As part of the file crea- 
tion process, GIS performs the cataloging of the new data set. 
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Creating the Descriptive Data Set 

In a manner parallel to that 
for the Analytic file discussed above, ten 
DDT's were written for the Descriptive files. 
The first was to describe the source file; 
the second, to describe a GIS hierarchic 
file . The abjective here was to create a 
file of varying record length due to a vari- 
able number of lines of abstract or descrip- 
tion. This was accnplished by placing the 
lines of descriptive text in subordinate seg- 
ments; subordinate, that is , to a master or 
zero-level segment containing record identi- 
fication and other information. A count field 
in this segment keeps count of the number 
of lines of description in each record. The 
DDT for the new hierarchic file (Fig.7), 
describes a variable length, blocked record, 
containing a variable number (up to nine) 
of fixed length segments called SUB, each 
of which contains one line of description, 
and one fixed length segment named MSTR. 



QUERY WEISATAP 
LOCATE RECORD 
MOLD HOL05 RECORD 
EXHAUST RECORD 




SORT H0L05 ASC YEAR, 


SERIALNO 


CREATE WEISAGIS FROM H0L05 
STRUCTURE ANALYTIC FROM ANALYTIC 
INSERT ANALYTIC 


QUERY WEI SAG 1 S ( ♦ 1 ) 
LOCATE RECORD 
LIST RECORD 
EXHAUST RECORD OR 25 
END PROCEDURE 





Fig. 5 . f hree-step GIS procedure to create 
new Analytic data set. Fourth step only 
reads 25 records of new file. 
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Dpi ; 

FILF»NAMF*wriSnr.t${ 

FLD: NAMF- *YE AW « LENGTH®? t JHST=L; 

DECD:TYPSPr.= lKI)P, LGTHA®? • LGTHF=4» FG0VAI. = 66t!9/>6,67 * I9A 7, 
#69, 1 9r,o,70,l 97 0,7 1. I '>71,72,197?; 

FLO: NAMF=SFP I ALNO, LENGTH®'*, Jl)ST®L, HFAOFP=SER I AL NH.; 
cm* WAMC*;<nMTII. LENGTH*? • JUST = L ! 



FLO: NAME -DAY. LENGTH®? , JUST-L ; 

FLO: NAME »COLL 1 0. LENGTH®?, JUST=L, HFAUnR=COLLGCT ION IDS 
OECD: TYPSPC=LKUP, LGTIU=2, LGTHF® A0» CONVA=S, CONVF=S, EEDVAL® ; 
#01 , S INO-INDI AN BORDER WAR (HOGGARD) ,; 

#02, BERLIN CRISIS I MCCLELLAND I ,i 




#23,16 INTERNATIONAL CONFLICTS I F ITZSI MMCINS ) , ; 
#24, SINd- INDIAN INTERACTIONS (HOGGARD) •; 

#27, TIMES OF INDIA SAMPLE J 



SEGM:NAMF,»MSTR, LEVEL=00, TYPE=RECORl>, UNIND=Y , SORT=YEAR , A, SFRI ALNO, A 



DATM:DSORG®PS, CREATE=YES, CATLG=YFS, FNTRI FS=3, OSNAMF=H IB3ARU. I NTRGL 
#BLKS I ZF=6524» LRECL=652, RFCFM® VB, UNIT=2400, DCB=( DF.N=.3 » ; 



FLi): NAME» DF SCRIPT. LENGTH=A2 , JUST=L, HFADrR=DESCRI PT ION; 
FLD: NAMEaLINFID. LENGTH®!, JUST=L, HFADER=L I NE 10? 



SJEGM: NAME®SUB, LEVEL«Ol , TYPE=TRA ILR, OPTION«CNT, ; 

#OPTFNM*DFSCCNT , UN I NO® Y, SORT=L I NE ID , A} I ENOJ. 

Fig. 7. DDT for new hierarchic GIS Descriptive file indicating a 3-generation data set. 

Like the DDT for the Analytic file, the DDT for the Descriptive fiie contains many 
decode tables; note the decode table for COLLID. The file conversion procedure is very similar 
to that for the Analytic file with one important exception: since there Is now a subordinate seg- 
ment , the mapping statement is "up" from the lowest level segment (the SUB segment). There 
is only one structure partition, STRUCTURE SUB FROM MSTR, where SUB is the name of the 
segment in the file to be created, and MSTR is the name of the source file segment (the source 
file only has one segment, i .e. , MSTR). The entire procedure to create this two-level hierarch- 
ic file is shown in Fig.8. 





FLD: NAMF®OESCCNT. LENGTH*! » JUST=R, HEAOGR=NUMBER DFSC LINES; 
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IX Proaeeeing for Automation - I 



Information Retrieval with GIS 



QUERY WEISDSOR; 

READ RECORD; 

HOLD H0LD20 RECORD; 
EXHAUST RECORD; 



created, one might be interested in several kinds 
of "information retrieval". First, are the extrac- 
tion of subsets from the Analytic file in a format 
amenable to further processing by one or more 
statistical programs. Second, one might wish to 
be able to read a chronology composed of abstracts 
from a certain time sequence, e.g. , all events 
during May 1968. Third, one might wish to 
query both files using the Analytic file as an in- 
dex or parameter list to retrieve particular records 
from the Descriptive file. Information retrieval 
for any of these three purposes is relatively easy 
using the GIS data management language. In the 
examples which follow, we will demonstrate 
information retrieval for each of the three types . 



Using the two data sets just 



SORT H0LD20 ASC YEAR, ; 
4SERIALN0, LINEID; 



CREATE WEISDGIS FROM HOLD20; 
STRUCTURE SUB FROM MSTR; 
INSERT SUB; 



QUERY WEISDGI S(*l) ; 
READ RECORD; 

LIST RECORD; 

EXHAUST RECORD OR 25; 
END PROCEDURE; 



Fig. 8. Three-step GIS procedure to 



create new hierarchic Descriptive data set. (,n these examples, processing was always in the 
(Fourth step only reads first 25 records). batch mode; GIS has teleprocessing facilities 



SUB Segments; • 

Fig. 9. Pictorial representation of the ‘VEIS Descriptive file hiervchic structure. 

Action Within the Middle East Group, 1966-69 
« 

* In this first example of information retrieval, we wish to construct a table showing 
frequency and percentage of event/interactions among the six countries ISR, UAR, SYR, JOR, 
LEB, and IRQ for the four year period 1966-69. That is, we wish to generate a subset from 
the WHS Analytic file where the ACTOR and TARGET is any of these countries. The subset 
will then be processed by a statistical program to generate tire required table. The first step is 
to write a DDT (a few minutes work), which describes the file (subset) to be created (see Fig. 10). 
Note that this file includes only selected fields. Since the mapping defined by the STRUCTURE 
statement is by field name, within the named segment, only the named fields <re mapped into 
the subset file. The second step was to write an SPSS (Statistical Package for the Social 
Science) program to construct the required table. (See Fig. 11). The third step was the writing 
of a simple procedure to create the subset of the Analytic file. 



but they were not implemented at the time these 
programs were run.) 



00 Level I YEAR I SERIALNO I MONTH I DAY I COLLIDI PESCCN1 



MSTR Segment DESCRIPTION (line 1 ) UNEIE 



DESCRIPTION (line 1) 



LINEID 




DESCRIPTION (line 2) 



LINEID 



DESCRIPTION (line 3) 



LINEID I 
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DDT; 

FILE:NAME-WEISDEM1; 

FLD: NAME-YEAR, LENGTH-2; 

FLD: NAME-MONTH, LENGTH-2; 

FLD: NAME-DAY, LENGTH-2; 

FLD: NAME-ACTOR, LENGTH-3; 

FLD: NAME-EVENT, LENGTH-3; 

FLD: NAME-TARGET, LENGTH-3; 
SEGM: NAME-DEMOl, LEVEL-00, ; 
#TYPE«RECORD, UNIND-N, SORT-; 
#YEAR,A; 

DATM:DSORG-PS, CREATE-YES, ; 

* CAT LG -YES, UNIT-2400, ; 
IRECFM-FB, LRECL-15, BLKSIZE-; 
ff 3000, DCB-(DEN-3), ; 
#DSNAME-H183ARU.SUBSET1; 

END; 



QUERY UE I SAG 15; 

LOCATE RECORD; 

IF ACTOR EQ '045', '651', '652', ; 

^‘ , G60 , , , 6G3 , , , 6G6 , ; 

AMD TARGET EQ * 645 ' , ' 551' , ' 652 ' , ; 

#'660', '663', * C66 ' ; 
HOLD HOLD1 RECORD; 

EXHAUST RECORD; 

SORT HOLD1 ASC YEAR, SERIALHO; 

CREATE VIE ISO EMI FROM I SOLDI; 
STRUCTURE DEMO FROM ANALYTIC; 
APPEND DEMO; 

END PROCEDURE; 



Fig. 10. DDT defining file for use by an SPSS program. The file creation procedure (on the 
right), selects records conditionally from tiie Analytic file. 



// EXEC SPSS360, TIME-3, REGION-150K, PARM-' 10000' 






//GO. FT02F00 1 


DD DUMMY 






//GO. FT08F001 
//SYSIN DD * 


DD DS N-H18 3 ARU. SUBS ET1, D ISP-OLD 






RUN NAME 


WE IS DEMO 






FILE NAME 


WEIS DATA 






VARIABLE LIST 


YEAR, MONTH, DAY, ACTOR, EVENT, TARGET 






INPUT MEDIUM 


TAPE 






* OF CASES 


ESTIMATED 2000 






INPUT FORMAT 


FIXED (3F2.0,F3.0,F2.0,1X,F3.0) 






RECODE 


ACTOR, TARGET (1 THRU G44,646 THRU 650,653 THRU 
GG1,6G2,GG4,GC5-(J) <G45-6)( 051-2) (652-3) (660-5) 


G59, 




(663-4) (GG6-1) (G67 THkU HIGHEST-0) 






RECODE 


YEAR(6G=1) (67-2) (68-3) (69-4) 






VALUE LABELS 


ACTOR, TARGET (0) OTHER (1) ISR (2) 


UAR (3) SYR 


(4) 




JOR (5) LE3 (G) IRQ /YEAR (1) 19G6 
(3) 1963 (4) 19G9 


(2) 19G7 




PRINT FORMATS 


YEAR TO TARGET (0) 






FASTABS 


VARI ABLES-ACTOR(U,6)/YEAR(l, 4) 
TABLES-ACTOR BY YEAR 






OPTIONS 

READ INPUT DATA 


3,5 







Fig. 11 . SPSS program to process data in SUBSET1 file. The first RECODE statement 
assigns values to the ordinates, the second to the abcissa. VALUE LABELS assians row and 
column labels; FASTABS dimensions the table array; title is assigned by TABLES^ statement. 



IX Processing for Automation - I 



YEAR 

COUNT I 

COL PCT I 1966 1967 196,8 1969 ROW . ! 

I TOTAL 

I II 2 1 3 1 4 1 

ACTOR 1 1 1 1 —I ■! 

II 56 I 125 I 158 I 299 I 636' 

1 SR I 35.1 I 39.2 I 43.6 I 47.1 I 43.3 1 

-» 1 1 1 — ~ — i ; 

21 26 I 69 I 68 I 186 I 349 i 

UAR ! 16.9 .1 21.6 I 16.8 1 29.3 { I 23.7 j 



3 1 40 I 54 I 16 I 35 I 145 

SYR I 26.0 I 16.9 I 4.4 t 5.5 I 9.9 

-I 1 1 1 1 

4 1 23 I 55 I 103 I 72 I 253 

JQR I 14.R I 17.2 I 20.5 I 11.3 I 17.2 

-I 1 1 1 1 i 

5 1 II 3 1 14 I 26 I 44 

LEB I 0.6 I 0.9 I 3.9 I 4.1 I 3.0 

-1 j j j j 

6 1 10 I 13 I 3 I 17 I 43 

IRQ I 6.5 I 4.1 I 0.8 I 2.7 I 2.9 

-I 1 1 t 1 1 

COLUMN 154 319 362 635 1470 

TOTAL 10.5 21.7 24.6 43.2 100.0 



Fig. 12. Table constructed by SPSS program of Fig. 11 showing frequencies and percentages 
of event/interactions among the six countries ISR, UAR, SYR, JOR, LEB, <md IRQ for the 
period 1966-69. Each element of the array contains absolute counts and column percentages. 
Total number of event/interactions Is 1470. 



This table illustrates information retrieval and organization using GIS and SPSS. 

Data in this table may be interpreted as follows: During 1966 ISR was the ACTOR country 
among the group 35 .1%‘of the time with 54 out of 154 yearly events. During that year SYR 
was in second place with 40 events and 26.0% of the action. However, during 1967, second 
place went to UAR with 21 .6% of the event action. In 1968, JOR overtook UAR for second 
place with 28.5% of the event activity. 

The table In Fig. 12 shows e\e nt/interactions on an annual basis. In order to pre- 
pare for some statistical programs, another table was prepared, like that in Flg.l 2, but indicating 
event/interactions on a monthly basis. A typical row in the table of crosstabulation totals by 
country would be: 



BY ACTOR 



ISR UAR SYR JOR LEB IRQ 



ROW TOT 
12 



5 
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Using the data in the "Monthly by Actor" table, punched cards equivalent to each 
row of the table were prepared for each month in which the row total was not zero. This set of 
data was input to the BI-MED program BMD03D (Correlation with deletion) which computes a 
simple correlation matrix, omitting values of variables which the user may specify. The output 
from this program includes (1) Correlation matrix printed, and (2) Number of pairs of observations 
used in computing each correlation coefficient. Another similar program, BMD02D (Correlation 
with transgeneration) computes correlation coeficients, averages, and measures of dispersion on 
entering variables and/or transgenerated variables from selected cases whose values for specified 
■variables have a precise logical relationship in agreement with a specified Boolean expression. 
Output from this program includes (1) Sums, (2) Means, (3) Cross-product deviations, (4) Stand- 
ard deviations, (5) Variance-covariance matrix, and (6) Correlation matrix. Optional output in- 
cludes a one-page cross-tabulation plot of any two variables. 



Regression Analysis of GIS-Generated Data Set 

This second example of information retrieval, data organization, and data analysis 
using GrS-created files and the SPSS statistical package, uses the same data set created for 
the previous example (See Fig. 10). What is desired now, however, is to compae ISR as ACTOR 
to ISR as TARGET (ISR output to ISR intake) from three countries UAR, JOR, and SYR , for 
month-by-month distributions. For this data, multiple and stepwise regression regressions were 



// EXEC S PSS360, TIME = 4,REGION = 200K / PARM= ' 40000 1 



RUN NAME 
VARIABLE LIST 
SUBFILE LIST 
# OF CASES 
PROCESS SBFILES 
INPUT FORMAT 
REGRESSION 



READ INPUT DATA 



WE I SDEMO--P.EGRESS I ON ANALYSES 
ISROUT FROMUAR FROMJOR FROMSYR 
PROB4B,PROB5B,PROB(jB 
48 48 48 

EACH 

FIXED (4F3.0) 

VARIABLES=ISROUT TO FROMSYR 
REGRESS I ON= ISROUT WITH FROMUAR TO 
REGRESSION* ISROUT WITH FROMUAR TO 



FROMSYR 

FROMSYR 



( 2 )/ 

( 1 ) 



Fig. 13. Program deck for SPSS regression analysis. PROCESS SBFILES EACH indicates 
that each subfile is to be considered separately and in order. Each subfile consists of 48 
cases (months); subfile 4B compares all events; subfile 5B calculates for all "cooperative" 
events; while subfile 6B calculates for ''conflict" events. The first line of the REGRESSION 
statement requests multiple regression; the second, stepwise regression. 



done with intakes from ISR from each of the three countries as independent variables, and ISR 
output as dependent variable. This analysis was performed for three types of event/interaction: 
all events; "cooperative" events; and "conflict" events. For illustrative purposes, only the 
analysis for cooperative events will be shown here. The SPSS program for these regression 
analyses is given in Fig. 13. The output from this program, multiple stepwise regression calcu- 
lations for cooperative events in shown in Fig. 14. In addition to these SPSS calculations, 
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Fig. 14 . Stepwise multiple regression calculations by SPSS program comparing ISR output 
to ISR intake from UAR, JOR, and SYR for cooperative events. 



similar calculations were carried out using the BMD03R program (Multiple regression with case 
combinations). Output from this program includes (1) Sums and sums of squares, (2) Regression 
coefficients with standard errors and t-values, and deviation about regression with degrees of 
freedom and F-values, (3) Partial correlation coefficients, and (4) Intercept value. Also, for 
each selection and subproblem, a table of residuals can be obtained. Illustrated in Fig. 15 is the 
output from the BI-MED multiple regression calculations for cooperative event/interactions data. 
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Fig. 15. Multiple regression calculations output by 6MD03R program, comparing ISR 
output to ISR intakes from UAR, JOR, and SYR for cooperative event/interactions 
in the period 1966 - 1969 . 



Information Retrieval from both the Analytic and Descriptive Data Sets 

For this example, we wish to select and print out abstracts from the Descriptive 
file relating to records in the Analytic file where the ACTOR was USA, USR, UAR, or ISR, 
and the time was June 1967. What has to be done in effect is to search the Analytic file 
until a record is found satisfying these conditions, then run through the Descriptive file until 
the corresponding record is found. Then data from fields in both files will be printed out. 

The file search logic portion oHhe GIS procedure to select and print out'the required 
abstracts is 9iven in Fig.16. The QUERY- statement indicates the two files to be searched. 
The Analytic file (WEISAGIS) is searched to locate a record meeting the required conditions. 
The numbers in the ACTOR EQ statement are the code numbers corresponding to the four 
countries: 002=USA, 365=USR, 651=UAR,and 666=ISR. The READ WEIS DGIS state- 
ment indicates that the descriptive file is to be read until a record IDNUMBER in that file is 
found which corresponds to that previously located in the Analytic file (YEAR and SERIALNO 
of the Analytic file concatenated). When such a record is found, the subordinate segments con- 
taining the lines of abstract or description are to be read until the subordinate (SUB) segments 
are exhausted (EXHAUST SUB). Since only one such Descriptive record is to be read for each 
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Analytic record, the 
program drops down to 
the next statement, 
EXHAUST WEISDGIS: 
RECORD OR 1. Control 
is then given to the 
next statement, which 
continues the search of 
the Analytic file until 
all qualifying records 
have been LOCATE'd. 
The statements omitted 
in the partial program 
shown in Fig. 16 serve 
to control the format and 
location of data from the 
fields of both files, to 
control lines per page, 
and to qualify the repe- 
tition of a field value. For example, one would not wish the value of the field DAY to print out 
for every record; rather , it is desirable to print out the value only when the value changes. 

This is readily done by the qualifying statement: DAY ON DAY CH. This prints DAY only when 
the value of DAY CHanges. A sample page output from this search is shown in Fig. 17. 



Fig. 16. Logic search portion of program to search both Analytic 
and Descriptive files for records of event/interactions during June 
1967 where the ACTOR was USA,USR,UAR or ISR. 



OAV SERIAL NO ACTOR TARGET 



CHRONOLOGY FOR JUNE 1947 

DESCRIPTION 



09 2519 UAR US* NEW YORK TCNCS FIELD ERRTV 

NON-NlLl f ARY OEMUNSTRAT 10N» WALK- OUT ON 
UAR demonstrators TRIEO THREE TINES TO STORN USR EMBASSY IN UA 



2522 UAR ISR 



MEN YORK TIMES ARAB" I SR A EL I 1947 WAR 

SUSPEND NEGATIVE SANCTIONS* TRUCE 
UAR ANNOUNCES WAR W(TH ISR ENOEO 



2*21 ISR UAR 



NEW YORK TIMES ARAB-ISAAEL1 1967 WAR 

SUSPEND NEGATIVE SANCTIONS* TRUCE 
UAR ANNOUNCES WAR WITH ISR ENOEO 



2525 ISR USA NEW YORK TIMES ARAB- ISRAEL I 1467 WAR 

CHARGE* ’.RITICWE* BLAME* DISAPPROVE 
ISR PAM SAID USA PRS HAO PROMISEO GREAT THINGS BUT IN END ISR 
STOOD VIRTUALLY ALONE AGAINST ARAB WORLD 



Fig. 17 . Abstracts indicating event/interaction chronology for June 1967 for the Actor 
countries USA, USR, UAR, and ISR. Decode values for SOURCE, EVENT, and ARENA shown. 
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It is often cf interest to be able to visually or graphically show frequency distributions 
of event/interactions by specific countries over a limited time period. As an example, plots were 
made of acts by the countries USA, USR, UAR, and ISR for the 30-day period, June 1967. 
The required data file was constructed by GIS using appropriate search logic of the Analytic 
file. The total number of acts found was 319; it might be of interest to record weekly totals: 





FIRST WEEK 


SECOND WK 


THIRD WK 


FOURTH WK 


USA 


29 


19 


23 


30 


ISR 


15 


34 


9 


19 


UAR 


29 


16 


6 


18 


USR 


17 


13 


21 


21 



Table 2. Weekly totals for USA, ISR, UAR, and USR as ACTORS during the 
month of June 1967 . Total acts (event/interactions) for month is 319. 




MU-MOO 



FF - THE FREQUENCY OF ACTS BY I S R IN JUNE *1967 
LL - THE LEAST SOUflRES TREND Y=fH0X. WHERE 
3. 39080 

B* -0.05317 

Fig. 18. Graphical display of daily event/interactions by ISR for June 1967 . Plots 
done on a Series 2000 High Speed Incremental Plotter (Graphic Systems Division of Com- 
puter Industries, Inc.), showing least mean square line coefficients. 
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The program written to perform the plotting operations and to calculate the least mean square 
constant and line coefficients was in FORTRAN . The main program performs the plot 
dimensioning and reads the data set containing the frequency information; a subroutine calculates 
the constant and x-coefficient for the least mean squares line, y = a + bx; The calculated 
values of a and b are returned to the main program. The graphical output for this combined 
data retrieval and presentation is shown in Fig.18. 



Conclusions 



The practicality of retrieving and presenting complex interactions in the area of 
political science has been demonstrated with the use of several data management and statistical 
programming packages. Extensions to more sophisticated analyses is possible and may be 
readily implemented. 
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SYNOPSIS 



This paper suggests a documentation indexing system based on 
Jforphologlcal Analysis coupled with eelected Perimeters (MAP) • 
This indexing, besed on functions, boundary conditions, sod 
constraints, permits highly structured retrieval. This approach 
uniquely classifies documents Into discrete functional paths - 
permitting retrieval Independent of discipline. 
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Introduction 

Today's rapid technological changes are producing new information and new 
combinations of Information crosslinking all scientific disciplines. 

This Information is highly structured. Most existing Indexing and coding 
systems exhibit low retrieval efficiency with highly structured Inform- 
ation. Fcrhaps one exception Is the chemical field where the chemical 
formulas serve to structure the Information quite well. 

For many years numerous renmmed Information scientists have suggested many 
approaches to this problem of structuring Information, such a si colon class- 
ification, faceted systems* etc. Today, more than ever, these suggestions 
are relevant. New applications for technical Information, especially In 
technical forecasting, advanced design and analysis and economic projections 
foreshadow a tremendous upsurge for structured technical Information, 
coupled with constraints, conditions and economics. 

Economic constraints and cost-benefit analysis are having a tremendous Impact 
on selected technical situations. No longer Is the first solution the most 
Important. It Is the best solution that counts. It Is most important to 
have a coding system that extracts and presents this essential data from our 
reports. The pros and cons of such a system are examined In this report. 

Past Experience 

As a result of my practical experience with technical information center 
users at various government, university and Industry levels, my views on the 
effectiveness of present-day coding systems have crystallized. For example, 
some of the general types of Information sought were: 

1, What Is the state of the art of Function X ? 

2, What devices are available for performing Function Y ? 

3, What are the specifications, operating conditions 

and cost for System Z ? 

The search for Information - some highly .structured - was usually carried 
out by professional Information specialists using existing techniques back- 
ed by computer support. 

Answers were supplied In the conventional mAnncr; once In a while, however, 
a cross-discipline solution or an alternative path would be suggested which 
led to some unique solutions based on the function performed. 

Functional Approach 



Most information requests centered about specific functions within a 
given discipline such as: detection, amplification, jolnlrg, mixing, 
analysing, etc. Constraints regarding the function, such as coat per 
function, operational and environmental criteria were desired but extremely 
hard to retrieve. Figure 1 shows a typical matrix concerning 
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performance of a function. In referring to function, constraint and boun- 
dary conditions, we mean 

Function - what is going to be performed ? 

What will be the cost and what are the 
limits of operation and environment ? 

If available information could be coded in this manner, its usefulness as 
well as its accessibility could be Increased. Let us analyse some of today ' 8 
coding systems. 

The shortcomings of most of today's indexing and classification systems 
lie in their inability to provide functional , highly descriptive data to 
the users. The purpose of the classification system is to group similar 
items together so that a suitable coding system may be devised for refe- 
rencing these items. 

In the UDC system, for example, automation equi p ment would be classed 
under a variety oi headings: 



620.1 


Material Testing 


621. 1/9 


Machining Details 


621.3 


Elecirical Engineering 


621.5 


Pneumatic Energy 


621.6 


Fluid Distribution 


621.8 


Power Transmission, material handling, mecha- 
nical fixing, attachment, lubrication 


621.9 


Tools, machine tools , machinery 


It has been said^ that 


each of these headings divide into 50 - 250 


subheadinas. UDC is too aeneral to be applied to automation equipment. 
Even with the sophisticated SDI Selective Dissemination of Information 
systea, using descriptors, there are shortcomings in that all fields have 
to be coded to pick up the same function in any one discipline. 



Discipline Crossover 

Another factor to be considered is the impact of discipline crossover. 

In the beginning the sciences were broken down into disciplines for 
each field of scudy. Today these disciplines interact with each other in 
various combinations, forming new disciplines, such as bioengineering, 
magneto-hydro-dynamics , HKD, etc. This implies changes of search strategy 
to find complete information in these emerging disciplines. 

Another aspect of classification includes scale or level. We have in 
mind the following levels - complete systems, subsystems, components, 
parts or elements, materials, substances, atomic or subatomic levels. 

1) A Classification of Standard Devices for Mechanisation/Automation of 
Production Processes - W.T. Moodle, R.J. Sury 

THE PRODUCTION ENGINEER, June 1969 



687 



32.3 



W 



\ 

1 



Morphologioal Analysis with Parameters 



Co— uni cat ion of Technology 

If one aaka oneself why we publish so much technical information, one may 
answer it is to co— unicat e scientific results so that we may advance 
society as a whole. However, in order to accomplish this, we need to re- 
trieve the data, interpret and understand it and apply it effectively* 

This new knowledge should be made universally available, regardless of 
which discipline developed it. One method of achieving this goal is to 
employ functioi^l coding systems with discipline levels and boundary 
conditions identified, utilizing morphological methods for deep indexing 
of our technology* 

Morphological Method 

The term Morphological Analysis was coined by Dr. Fritz Zwlcky. He sugges- 
ted that, if one studies the features of any object (material or system), 
its features can be described by a hypothetical Morphological Box. This 
box would be composed of pigeonholes containing each feature. Zwlcky 
pointed out that this ordered analysis permitted full characterization of 
known objects as well as characteristics of objects yet to be invented. 
From this set of characteristics it is possible by permutation and combi- 
nations to work out a complete system description. It is also my thesis 
that this indexing system, coupled with boundary conditions, can fully 
and accurately index our technical information. 

Another aspect of the Morphological Method is that certain paths (solu- 
tions) can be eliminated from retrieval by imposing laws of physics and 
boundary conditions. Further refinements and selectivity can readily be 
achieved , 

If we take a problem that has many potential solutions, we usually analyse 
the problem by breaking it down into its basic functions and generate 
alternatives with parameters. Then, in finding the solution, we limit the 
conditions and select the answers based on optimized parameters. 

In the course of establishing the solution "matrix" we can arrive at all 
potential solutions to a problem. This matrix becomes an indexing tool in 
that it pinpoints discrete solution paths. The matrix can be used to index 
technical articles describing a particular solution to the problem. Figure 
2 shows the path from problem to solution via the functional parametric 
route: 



fftOtltM 




Figure 2 
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In order to Illustrate the operation of the Morphological Analysis Para- 
meter Information Coding scheme, we should examine a few typical systems. 
Zwlcky's method concerns the totality of all the solutions of a given pro- 
blem. For example, If we were to examine possible propulsion power plants 
and If only. one specific device, method or system Is asked for, this me- 
thod inmediately generalizes the Inquiry to all possible devices, methods 
or systems which provide the answer to a more generalized request. We can 
also start out with the functions and parameters to establish the matrix. 
Each parameter Pi will possess a member of Ki Independent Irreducible 
values p , p ... p.. If one element is encircled in each functional ma- 
trix and all the circles connected, each resulting chain (isomorph) repre 
s^rnts one possible solution of the original problem - this scheme of ma- 
trices can be used to construct an N-dlmenslonal morphological box. The 
morphological method is an orderly way of looking at things and provides 
a systematic perspective over all possible solutions of a given problem. 

Figure 3 Illustrates Chemical Propulsion Power Plants: 




* Intrinsic or extrinsic chemically active miss 
= internal or external thrust feneration 
- Intrinsic, extrinsic, and zero threat aepnentation 
= infc.nal, external thrust aufnentatioa 
= positive and nefative jets 

nature of the conversion of the chemical mercy into 
mechanical enerfy 

vKues, air, water, earth 

translatory, rotatory, oscillatory, and no notion 

psems, liquid, solid state of propellant ■ 
continuous and intermittent operation 
setHpitini and not setf-ifnllini propellants 



Zwicky rerrarfca that m this, if no intenml contradiction! were p r es e nt, 
would n i\ke poetebte 

7T*< - 2x2x3x2x2x4x4x4x3x2x2- 34,164 
!■! 



Figure 3 
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If the totality of the paths was considered and no contradiction was 
present, a possibility of 36,864 propulsion engines could be construc- 
ted. However, internal restrictions and physically impossible combina- 
tions reduce the number to 25,344 propulsion engines of which only 30 
types exist today. 

This system describes these engines - both those that exist now and 
those that can exist. In addition to the basic operational parameters 
cited, the specifications and environment factors can be considered, 
as shown in Figure 4. Additionally, the cost factors can bo included 
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A simplified version of the morphological analysis dealing with the 
electric clothes dryer was presented by Alger and Hayes, as shown in 
Figure 5 below: 




Figure 5 
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2 ) 

A very detailed application of the Morphology Method was made on the OSCAT 
project dealing with the best processing route from an ironbearing feed- 
stock (ore) to a shaped pure iron product. 

The metallurgical operation of extracting metal from ore involves three 
functions : 

(1) Separation - extracting and purifying the metal values from the 

gengue 

(2) Reduction - reducing the metal compound to metal 

(3) Shaping - forming the metal into useful shapes 

One set of conditions is wet or dry processes: 

Wet - iron is always liquid or solid 

Dry - Vaporized as a volatile compound 

(Only wet processes were considered). 



A comprehensive list of separation methods is shown in Table 1. 

Figure 6 shows the complexity of the problem and the network of alterna- 
tives . 



(FfFOSTOCK) 

Comminution N~ ^PralraitroinS 



HWogtntlion 
CwbonyUtkm 
DMftrion 
SuMlmitfon 
AdK»p*!on 




Gravity 



Inching 

OytUtkitfon 
Proa'pHttion 
Filtration 
Evaporation 
Sohraot Attraction 
Ion tadiing* 
Chromitograghy 
Ojddition 
alteration 
Absorption 
Biologic*! Inching 
Swelling 

Orgmomstoltici 
Armlgimitian 
Zoos milting 



RE0UCV1ON (METAL) ELECTROLYSIS 



Figure 6. 

2) A. V. Bridgewater M.Sc. Theais, University of Astor in Birmingham, 1966. 
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(Absorption 




G-L 






Adsorption 




G-S, 


L-S 




Amalgamation 




L-S 






Biological leaching 




L-S 






(Carbonydatlon 




G-L, 


G-S) 




Centrifugation 


M 


G-L, 


G-S, 


L-L, 


Chromotography 




G-G, 


L-L 




Clathratlon 




L-S 






(Condensation 




G-L, 


G-S) 




Crystallization 




L-S 






Cyclone 


M 


G-L, 


G-S, 


L-L, 


Dialysis 




L-L 






(Diffusion 




G-G) 






Distillation 




G-L 






Electrodialysis 




L-S 






Electrolysis aqueous 




L-S 






Electrolysis fused 




L-L, 


L-S 




Electrophoroals 




L-S 






Electrostatic separation 


M 


G-L, 


G-S, 


L-S, 


Evaporation 




G-L 






Filtration 


M 


G-S, 


L-S 


f 


Flotation 




L-S 




I 


Fluidization 


M 


G-S, 


L-S 




Foam 




G-L 






Gravity 


M 


G-L, 


G-S, 


L-L, 


(Halogenatlon (non-aqueous) 




G-G, 


G-L, 


G-S, 


(Ion beam 




G-G) 






Ion exchange 




L-L, 


L-S 




Ion exclualon 




L-S 






Leaching 




L-S 






Magnetic separation 


M 


S-S 






Organometalllc 




G-L, 


L-L, 


L-S 


Oamoals 




L-S 






Oxidation 




G-G, 


G-L, 


G-S, 


Precipitation chemical 




G-S, 


L 




Precipitation electrodeposition 


L-S 






Precipitation electrostatic 




G-S, 


L-S 




Production (chemical) 




G-G, 


G-L, 


G-S, 


Solvent extraction 




L-L 






(Sublimation 




G-S) 






Sweeting 




L-S 






Thermal decompoaitlon 




G-S, 


L-S, 


S-S 


(Volatilization 




G-G, 


G-L, 


G-S) 


Zone melting 




L-S 
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Soae processes are Inherent elds (M) to other processes. Thus, the first 
stage reduces down to 27 possible separation methods. The complete treat- 
ment was broken down into 6 stages - complete permutations Indicated: 



27 Routes of 
729 M " 

19,683 " " 

531,441 M M 

14,384,907 " " 

387,420,489 " " 



Stage 1 
Stage 2 
Stage 3 
Stage 4 
Stage 5 
Stage 6 



Boundary -limitation: 1. Compatibility 

2. Economics 



What are the implications? First of all, over 387 million classes of pa- 
pers could be written about ore reduction to final form, each describing 
some particular route and each presenting an index point. If the boundary 
conditions are defined, wet or dry process, starting state of metal, etc., 
this would provide a reduction of routes. Furthermore, if costs-benef its 
are given per route, a tremendous fallout occurs, reducing the possible 
routes still further. Figure 7 shows the convergence of routes as a function 
of applying boundary conditions. 
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Each of these applications cite exact indexing points for complete retrieval 
based on functional needs. The system provides for indexing of reports based 
on theoretical system or systems yet to be invented. Precise retrieval is 
possible, using this technique * 

Coding System 

One system proposed for coding the matrix is by functional parameters and 
their variables. 

PROPULSION (CHEMICAL) 

* P u 21 21112 1111 

P^ indicating 11 parameters 
and 21 . . . 2 » 2nd variable of parameter 1 

1 « K x 1st " " " 2 

The position of the number in the chain indicates the parameter. This code 
can easily be handled by existing computer techniques. 



Doundary Conditions 



In addition to coding the functions and subfunctions, and their parameter 
variations, it is necessary to code the boundary conditions. Some suggested 
factors include: 



A. Environment 



1. Temperature 

2. Humidity 

3. Pressure 



B. Dynamic Properties 



1. Load 

2. Strain 

3. Stress 

4. Strength 



C. Mechanical & Physical Properties 

[ 

i 

i 



1. Density 

2. Weight 

3. Elasticity 

4. Size 



| D. Theoretical Concepts 1. Fatigue 

j 2. Safety Factor 

i 

! E* Economic 1. Cost Factors 

! 2. Initial Cost 

3. Operating Cost 



These factors add a tremendous depth of value to the indexed technical re- 
port. It allows one to determine the limits of a system, the cost per function 
and to compare systems favorably with one another, independent of discipline. 
This boundary condition coding also provides more specif ivity to the retrie- 
val. In other words, given a set of boundary conditions, one could ask for all 
solutions with emphasis on lowest cost. 
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Cogglgtt Coding 

In addition to MAP codes, there would be the boundary conditions codes for 
each laomorph. These could be coded simply by letter, number and range 
such as: 

A - 1 - 50* C 

2 - 50 % 

3 - 100 PSI 

B - 1 - 100,000 8 

2 - 150,000 

3 - 160,000 

4 - etc. 



Constraints on Authors 

One of the negative factors of structural indexing is the constraint placed 
on the authors in determining all these factors. Yet, on the other hand, this 
deep indexing would force the the author to convey more information about 
this subject, and would allow users to understand and analyse the report 
more fully. 



Rewards 

If all documentation were coded in the same way, the users could readily 
determine the effect of changes (isomorpha) and determine the results, in 
effect, using technical information as a system model. 

The overall benefits would result in better communication of technology, 
particulary across discipline boundaries. This would be especially true at 
the device level where many devices perform the same basic functions, some 
better than others, for example, the machining of metals can now be done: 

mechanically 

acoustically 

chemically 

thermally 

optically etc. 

The approach taken depends on the operating and boundary conditions. 

Suggested Development of MAP ICS 

As mentioned at the beginning of this paper, MAPICS is only one approach 
to structural indexing. However, for this approach as well as the others, 
certain development measures must be taken: 

1. A functional thesaurus muat be developed. 

2. The boundary condition factors must be tabulated. 

3. The system must be widely tested. 
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Potential Benefits 



If the system in its present or modified form proves useful for structural, 
indexing, then certain benefits would be possible, such as: 

1. Precise in-depth retrieval 

2. Functional retrieval independent of discipline 

3* Select retrieval based on boundary condition limits 

4. The ability to see "gaps in technology" 

5. The ability to cross-fertilize disciplines by "gap filling" analogs 

6. The ability to organize the development of a "function" 

7. The ability to "predict" or "forecast" new technology based on possible 

next steps 

8. The ability to find wider applications for developed technology in 
different disciplines 

9. The aid to authors in preparing their papers for detailed coverage 

10. The ability to' "see" small change effects on a total system 
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THE NATIONAL SERIALS PILOT PROJECT 
Til lie Krieger 



SYNOPSIS 

The Serials Pilot Project attempted to work out the 
details in setting up a serials data base. Many of the 
problems encountered resulted from variations in cata- 
loging practice and rules* Some of the solutions are 
suggested, together with a discussion of some of the 
basic problems which have yet to be resolved* 



Serial literature is one of the most important parts of a 
library* s collection and is indispensable to research in all areas 
of knowledge* Control of this type of publication has long been 
of concern to libraries and to scholarly and research communities* 

It is difficult to control because its date elements are mobile, 
often independent of each other and require constant up-dating* A 
serial is in a fluid state; it changes to meet the needs of its 
audience, the opinions of its editors and the desires of its pub- 
lishers* Each issue must be accounted for and estimates of the 
number of serial titles currently being published range upward from 
250,000* Titles are often difficult to identify, describe and locate* 



Background of the Project 

The National Serials Pilot Project was undertaken to determine 
the feasibility of establishing a National Serials Data Program* 

The requirement for this system had been established by the 
Information Dynamics Corporation study in 1965* 

The three national libraries, i*e* Library of Congress (LC), 
National Agriculture Library (NAL), and National Library of Medicine 
(NLM), with the cooperation of the National Science Foundation, 
the Council on Library Resources and the Joint Committee on the 
Union List of Serials Joined to implement the first phase of the 
National Serials Data Program* 
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IX Prooeeoing for Automation - I 

Work on Phase I of the program was conducted by LC and resulted 
in the MARC format for serials. Work on Phase II was to be more 
detailed and procedural than that done In Phase I, which was largely 
exploratory. Each of the national libraries had its own definition 
of a serial and their practices of handling serials differed. A 
decision was made to continue with Phase II outside the confines- of 
a particular library in order to be free of the policies and proce- 
dures of a single institution. 

Initial funds were furnished by NAL. The Association of Research 
Libraries agreed to act as Executive Agent for the three libraries 
and the National Libraries Task Force on Automation and Other Co- 
operative Services provided policy guidance. An advisory committee 
of librarians from across the country was appointed to advise the 
National Serials Pilot Project. 

Mr. Donald W. Johnson was engaged as the director in September 
1969 and Phase II of the National Serials Data Program began. 



Objectives and Goals 

The objectives stated were: 

1. To create a machine-readable file containing bibliographic 
data on live serials in the fields of science and technology. 

2. To produce a number of preliminary listings: a union list, 

a list of titles held by the national libraries and a list of titles 
not held by any of them. 

3. To produce written reports detailing the findings of the 
project: problems encountered, solutions attempted with the results, 
and recommendations for future activities. 

4 . To develop files, techniques and procedures which would 
demonstrate the feasibility of a national serials program and which 
could then form the base for such a system. 

The national libraries envisioned the possibility of using 
compatible data bases for a number of purposes — determining the 
extent of their holdings, identifying gaps, locating items not held 
by one of them as well as showing unnecessary duplication. It could 
act as a basis for allocating resources and responsibilities among 
them. 



Data could be obtained to assist in planning for effective 
utilisation of their serial collections. 

On a broader level it could increase the availability of the 
world's scientific and technical serials both bibliographically and 
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physically. It could aid in the development of more effective and 
efficient library networks and information systems* It might simplify 
and expedite the processing of the literature and it would facilitate 
the planning t evaluation, and direction of programs and projects 
involving all of serial literature. 

Among the users of the Program would be the national libraries, 
all other interested libraries, individual users, publishers and 
distributors of serials, and abstracting and indexing services. Such 
a central store would allow the sharing of the results of intellectual 
activity and minimize costs incurred in such a system. 



The Base and Format Decisions are Agreed Upon 

The Library of Congress agreed to re-write their MARC internal 
processing programs in COBOL F, to provide source decks and also to 
provide temporary disc storage for Project data. The Project 
utilized remote-access typewriter terminals (ATS) which were linked 
by telephone over ordinary commercial lines to the 360/40 at 1/Z in 
an on-line mode. 

Each week the input data were to be read onto a magnetic tape 
which was then to be processed on the NLM 360/50 in a one-batch- 
per-week mode. For practical purposes then, the system was batch 
mode • '> 

The Union List of Scientific Serials in Canadian Libraries , 
third edition, was selected as the source or base file. It was on 
tape and available to the Project. It was a working system which 
had gone through several editions. The form of entry was established 
according to the cataloging of a single library, and thus, had a 
degree of uniformity. 

An entry, as defined by the Cataloging Code, is the heading 
under which a record of a bibliographic entity is represented in a 
catalog or list, generally the author or title of the work. 

The Canadian list had to be reformatted to the ATS output 
format and from that stage converted to the MARC serials format. At 
the time it was believed that the Canadian list could easily be re- 
formatted. This turned out to be a false assumption. The structure 
of the Canadian file had been devised many years ago and the file 
characteristics were not known. 

The reformatting took seven months to complete and, even then, 
was not completely satisfactory. The MARC format is highly particu- 
larized while the Canadian was less so, e.g. data elements were com- 
bined in the Canadian list which were handled separately with MARC. 

In a reformat program, one must be able to identify data elements in 
the file toeing reformatted and equate them with data elements in the 
file to be created. In this instant there was not a one-to-one 
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correlation between elements in the two formats. It was not possible 
to break out the elements by computer. Going from the general to the 
particular was not possible and each record required additional edit- 
ing. In any event as time went on the reformatted file could not be 
made accessible to the Project through its terminals because it was 
on the NLM computer, to which the terminals were not connected. 

The Project was the first group to make use of the MARC serials 
format and since they were working in cooperation with LC, became 
an effective field test for it. 

The major decisions as to format and base were resolved in the 
fall of 1969. In February of 1970 two librarians were added to the 
staff, which until then had included only the director and his sec- 
retary. It was time to begin developing procedures and testing 
theories • 



Work Begins and Basic Problems Assert Themselves 

While the reformatting of the base file was underway, another 
list of titles which had been reformatted to the ATS output format 
was made available. This was a list of serials indexed in Index 
Medicus , a total of 2243 medical titles. The staff compared this 
printout against the printed volume of the Canadian list# All titles 
which were found would ultimately be entered into the Master file 
via work on the Canadian list. Titles not on the list were to be 
entered on a custom basis. 

Approximately 275 titles were thus segregated and the process of 
bibliographic verification, or searching began. It was necessary to 
search for and verify the various data elements for content and form. 
This would lead to each bibliographic entity being described in a 
unique way which would be identified in the same way on all lists 
and in all files. Since the 275 titles were specific to medicine 
it was not surprising to find that LC had records for only about one- 
sixth of the titles. Once searching in two libraries had begun, 
problems immediately arose. 

The Task Force at this time directed that LC files were to be 
considered prime authority and that all searching was to originate 
there. The choice for entry, as well as the form of the entry were 
to be taken from LC files. Only if the title were not found in those 
files were other sources to be used. It was hoped that this proce- 
dure would result in an authoritative serials data base. 



A Basic Problem: Entry 

Since LC was considered to be prime authority and knowing that 
they had the largest collection, it was assumed that the number and 
degree of inconsistencies might be reduced. Other sources generally 
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followed the examples set forth by L C. 

Catalog rules, or the manner in which a bibliographic entity is 
identified, were formulated for the purpose of making the content of 
the collection available to readers. It serves its purpose best if 
done according to an established system. 

Various access points leading to material are the author (s), and 
title (s). These are the entries. When one speaks of an entry there 
are really two facets to the term and each should be considered 
separately. ' ' 

The first deals with the manner in which the entry is represen- 
ted, which rules to follow so that a consistent approach may be 
followed and a manner of representation stylized. The second facet 
concerns itself with the ordering of these entries so that the main, 
or primary entry may be identified and given special status. 

A "main" entry is the fundamental or primary heading under.which 
a record is represented in a catalog. The concept has meaniryg*when 
one wishes to make a single entry listing, where each item d0r be 
represented only once. It has meaning when one wishes to produce a 
catalog card. The concept loses that meaning in a machine file where 
a list can be specified in any order from a single input. Yet the 
rules which govern the basic records from which the machine file is 
built cannot be ignored. 

Throughout the history of cataloging the method of determining 
the main entry has changed. When a change in the main entry occurs, 
be it author or title, the work has been re-cataloged. A change in 
a secondary or added entry might involve up-dating, an easier and 
less costly procedure than re-cataloging. 

Since the Project used the results and decisions of cataloging 
procedures to enter the material into the machine file, conflicts 
developed over the need to re-catalog which meant a new record, or 
the need to up-date. This leads to the subject of successive vs. 
consolidated entries, about which more will be said later. 

The problem regarding the form or representation of the entry 
has not been consistently followed in each of the libraries. In 
1967 the Anglo-American Cataloging Code was issued and shortly there- 
after adopted. This involved a new approach and led to a different 
type of representation. An example: 

old rules-- American Library Association. Public Library 

Association. Armed Forces Librarians Sc cc 4 rn. 
Military Library Standards Committee. 

nut under 

new rules — Public Library Association. Military Library 
Standards Committee. 
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This is only one of the variations possible when working with the new 
rules as opposed to the old rules. Yet the catalogs in all libraries 
show the effects of various rules and rule changes over many years. 

The decision to accept LC also meant accepting their policy of 
H superimposition tt . This means that if the library has already estab- 
lished a form for an author, personal or corporate, it will continue 
to use that form. It will continue to use that form even for new 
works produced by that body. Should the body change its name, which 
would involve cataloging a new work, then the newly done work would 
be represented according to the latest rules. All names new to the 
library # s collection would be formulated according to the latest 
rules while previously established names would continue under older 
forms. 



This results in having at least two sets of rules in a single 
data base. The only way of reconciling the difference is to develop 
an authority file in the new serials data base. 

An authority file would establish the form used, variations to 
that form would be entered into the file with a guide to the form 
selected: 



I.S.L.I.C. 

see 

Israel Society of Special libraries and Information Centres 

An authority file enlarges the possibility of locating the desired 
material regardless of the approach of the user and does not require 
that he know the rules being employed at any time. 

An authority file built at the same time as the master file of 
serials would permit the most consistent file, as well as one which 
can be drawn from as it is being built. It cannot be built prior to 
the file upon which it is based since there is no way of judging 
what will be needed. It can be built later, but this would involve 
looking at records a second time, a more costly procedure. Errors 
which have been made in the absence of such a file might never be 
detected. 

If there is not time, or money, to build a complete file with 
its necessary adjuncts now, how will there be time to go back and 
build what is needed and correct the errors made in the meantime? 

The HARC serials format attempts to work with the identifying 
elements of the particular work. It does not include an authority 
file: it does not preclude the use of one. 

Very early in the life of the Project, Mr. Johnson recognized 
that an authority file would have to be built. He suggested at that 
time that each record carry its own authority. Once searching began 
it became apparent that this would lead to an extremely large file 
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and net be aa efficient as one would like. 

The Pilot Project was charged with the responsibility of listing 
mterial as they found it. They had neither the means, staff, nor 
the authority to change or up-date what they found. They were to 
present the problems to the Task Force. 



Resolution of Other Problems 

The manner in which other data elements were found presented 
inconsistencies for which the Project developed procedures. These 
included: 



Modified letters (diacritical marks) 
Capitalization 

Language of Place and Volume designations 
Holdings information 



Modified Letters 

Some of the sources checked contained diacritical marks, others 
did not. Some sources had a variable approach. The staff did not 
always recognize when the marks had been eliminated and the file 
being produced has at least 35 different languages represented. The 
Project people discussed the variations found and the fact that the 
Project was employing three machines, each of which had a different 
character set. They reasoned that the marks originated as sounding 
devices and questioned the need for them. Checking for each language 
would have involved much time and cost. They concluded that elimi- 
nation of all modifications would be the most useful, since it would 
be consistent. 



Capitalization 

The Task Force requested that the output be in upper and lower 
case. The sources of bibliographic data are inconsistent but no 
rules for capitalization have been developed which the Project could 
employ. This led to the acceptance and transmission of data as 
they were found. 



Language of Place and Volume Designations 

Place and volume designations varied. At times they were in the 
vernacular, or the language of the Mterial; and at times in English 
regardless of the language of the piece. It was not always possible 
to locate the form in each of the many languages The English form 
was easily determined. The serials were not readily accessible and 
one could not assume what they carried. The decision was made to 

■ i 
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enter uniformly place and volume designation in the English form* 



Holdings Information 

An important objective of the Project was the inclusion of . 
holdings. When holdings were easily available, such as on the Index 
Medicus list which included detailed holdings for each title, they 
were included. Checklists were sent to the libraries to indicate 
what titles were contained in the Project Master Pile, and such 
holdings as were provided were entered. 

At time the holdings were entered in detail and at times in a 
code which indicated the fullness of the holdings. By means of the 
code developed by the Project it would be possible to know if the 
holdings were unknown, complete or substantially so, incomplete, held 
for a limited time, or if received but not held. 

Entering holdings on a detailed or specific level would require 
constant up-dating and a continuous inventory on the part of the mem- 
ber libraries. It would not be possible for all of the participating 
libraries to up-date in a constant manner, nor would it be possible 
for the system to enter such data. The increase in file space would 
be prohibitive even if only a few libraries were to be represented. 

It might, however, be possible perhaps for the local members of 
a network to carry holdings in a more detailed fashion while the 
national data bank listed only the code for a library indicating it 
held a particular title. 



Successive vs. Consolidated Entry Approach 

A consolidated approach to the cataloging of serials means that 
the serial is cataloged under the latest title with the complete 
history of the serial included in the notes. This approach is also 
referred to as the "entity" approach. 

Successive entry cataloging doss not involve re-cataloging each 
time a serial changes its name. Each distinctive title is cataloged 
with a note made to indicate the immediate predecessor and successor. 
More items are cataloged, but fewer and less complex notes are 
required. Also, fresh cataloging is simpler than re-cataloging as 
would be the case with a consolidated approach. 

The MAKC format for serials can handle either approach equally 
well. Looking forward to the probability that a Standard Serial 
Number (SSN) would be affixed to each distinctive title, the Project 
took the successive entry approach. Each title was entered as a 
separate record with machine links established so that the entire 
history of a serial could be represented as a bibliographic chain. 
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Standard Serial Numbering Experiment 

The Project was not- charged with the responsibility for working 
out details for an SSN proposal. But a complete serials data system 
would include an SSN. The format allows a tag for the use of one 
which will be used when the details have been worked out. The Project 
used a tag designated as a Local System Number. Building a number 
based on the ANSI approved SSN, they used the number to experiment 
with the linkage structure within the format. Links can be made to 
preceding, succeeding and all associated records. 



Up-dating 

Due to the temporary nature of the Pilot Project, no attempt 
was made to organize an up-dating procedure. Any continuing project 
would have to make provision for this aspect of maintaining the 
files. As a network comes into being and develops the up-dating 
procedures would be developed and strengthened. The Project could 
not undertake this area of development. 



Procedures of the Project 

The period from Septeaiber 1969 to November 1970 was largely 
spent in learning the nature of the materials with which they 
worked and in developing procedures. 

The procedures used to gather data were, in the main, simple. 

The bibliographers began by using the Canadian list as the base, and 
choosing a live title, would examine the various records in the files 
at LC. They would photocopy the catalog card when one was available, 
or copy the information they found. When necessary, they would seek 
additional information in other files. 

The editing procedures which fol lowed, entailed indicating the 
tags, indicators, delimiters, subfield codes and filling out the 
fixed field data required by MARC. All Information was entmred 
according to the MARC serials format and the MARC editing manual, 
with certain additions made by the Project staff. 

The ATS operator would then enter the data into storage. The 
work would be revised and corrected and held. Once a week all of 
the work done that week would be sent to queue. The data was de- 
queued and transferred to magnetic tape. The tape would be picked 
up at LC and taken to the computer at NU4 for processing. The 
additional material would then be added to the Project Master File. 

Specialised diagnostic listing were then produced, complete 
with error messages. These would be gone over in the office and' 
errors corrected. Procedures for correcting the file have been in- 
cluded in the program package. 



707 



IX Proceeoing for Automation - I 

A programmer joined the staff in September 1970 and the work of 
completing the programs and providing the documentation went forward 
rapidly. 

Prom November 1970 until the end of the Project, June 1971 f the 
time was spent in attempting to go into the demons tart ion stage. This 
led to a substantial amount of file building. Two editors and an ATS 
operator were added to the staff. A second port to the LC computer 
was made available. 

The 275 titles from Index Medicus resulted in a file of over 
400 records as associative titles were found and included. As each 
record was searched all previous and succeeding titles as well as 
supplements were also searched and added to the file. 

It was with these related records that the Project was able to 
establish and test procedures involving the linkages provided by 
MARC. 



Major concepts were stated in the formation of the Pilot Project. 
Working with the details has produced refinement in those expecta- 
tions. The technical feasibility of the initial program has been 
proved. 

Mr. Johnson has just completed his final report, and this 
report, together with the systems flow-charts, documentation and 
procedure manuals provide greater detail than were available at the 
time of this writing. 

Solutions to the basic problems must be arrived at before work 
proceeds. The problem of entry and the consideration of the forma- 
tion of an authority file might well be solved if a small group of 
individuals, faced with the immediacy of the problem, and each with 
a knowledge of serials cataloging and computer technology could be 
formed to work out the details and make the necessary decisions. 

The capability is here, the need is felt, may we now expect the 
decisions? 
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THE COMPUTER AND THE M.I.T. LIBRARY 
Myer M. Kessler 

Assoc. Director, Libraries; Director, Technical Information Program (TIP) 
Massachusetts Institute of Technology, Cambridge, Massachusetts 02139 

SYNOPSIS 

The Computer Applications Division of the M.I.T. Library sup- 
ports a Technical Information Program (TIP) that la engaged 
in the design and operation of computer facilities to serve 
the information needs of the teaching and research cowmity 
on our campus. TIP is a general data and text management 
subsystem which serves as a tool for search, retrieval, edi- 
ting, formatting and general manipulation of organized files 
such as data, bibliographic lists and mixtures of text and 
numeric information. 

In the past three years an information system has been operational at MIT which was 
originally designed with a considerable experimental component in mind. We were not 
constrained by the requirement that it be useful immediately. 

After the period of experimentation and design, the system as of two and one-half 
or threa years ago, was lmbeded in a rather mature computer and communications network 
bn the MIT campus. We were operational entirely on the time share, {remote access system 
known to some of you as the MAC system. It was a central computer with some 250 or 300 
bonsoles distributed largely in offices, laboratories, and dormitories around the MIT 
campus, although there were several, a dozen or so, in people's homes and two or three 
jin other cities* Largely, it was a computer communications network on the MIT campus. 
This was a generalised computer facility. That is, it was not designed or Intended 
primarily for information purposes. It was intended for education, for research, for 
program writing, and computing. 

On this system, we constructed the additional function of information access and 
to begin with ws used as our experimental medium what is a rather alzeable data base 
in the physics .literature. I will not describe it to you except to say that it con- 
sisted of titles, authors and the usual Information needed for access plus soioe rather 
unique features. 

At any rate, considerable prior experimention by us and by some outside judges 
convinced us that one could do a respectable job of on-line retrieval and of an inter- 
active mode available to the users * So the computer environment was a time-shared 
remote access system without a public and private memory sector. The literature was 
well beyond the critical size; that is, five years of physics literature, of some 
40 or 50 journals. The language of Interaction was natural language, fairly convenient 
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for us. It did not require any programming experience and people took it very well. 

The in-put out-put machinery was largely a typewriter console although we had about 
a dozen or so cathorayscopes in various places. 

The MIT community is quite different from many of the customary user communities 
in that it is a University community, rather similar in size and function to the 
Weizraann Institute or the Technion but with important differences. MIT, although it 
is an engineering and science school, is somewhat different, somewhat broader in its 
educational policy. We do have a school of music, for example. We have philosophy 
and literature, economics and so on. 

Second, MIT is essentially an educational institution and everything is directed 
at chat. Even the research that goes on at MIT has a very large educational component 
so that our measure of success had to include student participation in the system. 

The other important factor is the method of payment. From the beginning we deci- 
ded that at least for a while the payment scheme would be that of the partial subsidy; 
the setting of the system, the design, its maintenance and the services of data base 
will not be pro-rated and charged to the customer. The only thing that the user 
would pay for would be to pay for incremental costs of the actual time used in his 
interaction. So it was not free to the user but he did not by any means pay the entire 
cost of the system. 

When this was first presented to the MIT community, we were in a state of great 
euphoria because it looked as if the world was going to beat a path to our door. Lit- 
erally hundreds of people used it, including students, but it turned out - it was 
obvious within three months - that there was a great novelty effect in the thing, and 
that the user population was beginning to thin out and within six months we settled to 
a much smaller incidence of users, which give us some bad moments. But, on the other 
hand, this small curve is beginning to grow slowly. So the on-line response was 
rather satisfactory. 

However, it became clear within six months and in fact this has been reinforced 
in the past two years, that in the on-line use of the system what we expected did not 
happen, and some very surprising things did happen in the user behavior pattern. 

We thought that there would be a lot of interactive manipulation of the literature, 
and we all had plans and expectations for on-line interaction and man-machine inter- 
action. I don't know whether that is a local situation and to what extent it will be 
repeated in other places, but this did not happen in our case. What happened wa4 that 
the user very soon discovered that his needs don't change from month to month; that 
after a period of experimentation he can design a fclrly acceptable and satisfactory 
profile, or method of inquiry and that it doesn't change at all. The user flnda him- 
self asking the same questions month after month as new data is loaded. He very soon 
learned that it doesn't pay to sit at the computer. The facilities are available to 
leave a standing order, so to speak, in the computer and every month as the new 
literature is loaded, it is swept through these various profiles and whatever matches 
the profile is merely deposited in the private memory of the particular user, and he 
can go away for six months and find the answers in his private file. 

Intimate Interaction with the literature did not happen and I should say this was 
universal. It was obvious within six months that people will not sit and converse 
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with the literature, and the mode of operation is tending in the direction of SDI, 
although to be sure there is no human interaction. The profiles are designed and 
adjusted and manipulated by the users. 

As this went on, it became clear that it is too expensive to maintain long back- 
logs of literature for retroactive searches because physicists did not use is much, 
and so we removed the bulk of the literature from the storage, from the direct access, 
but we announced to our people that any time they wont to make an retroactive search, 
we will load the tapes and I must report to you that in two years we have not had a 
single request to do that. In chemistry, on the other hand, there is considerable 
call for retroactive information. But the physicists behave as if there is no history. 

How, other things began to appear. Immediately word came to us from the users: 

"look here, every month I am getting a response from the computer. I don't want to be 
receiving these little print-outs. I want to save the results on my own private 
memory and merge the monthly retrieval results as they come along." And this was an 
on-line interaction. We did not do it for the customers. We said, very well, we 
will develop a merge program and you do it yourself. And so the users began to accu- 
mulate a private library of their own consisting of the results of the monthly 
retrievals from the general literature. 

The next request went something like: "very well, we now have results of retrieval 
from the generally available line. However, we go to conferences, we speak to people, 
we get letters, we have some information sources that come to us from other roads, 
from other channels. We want to incorporate those information sources into the re- 
sults of our monthly retrieval or monthly SDI." So we had to develop simplified in- 
put techniques, editing techniques, up-dating, so that now the user could not only 
accumulate the results of his SDI retrieval on the computer but he could also add 
information, documents or even comments that came to him from other sources and in- 
corporate that into his file. 

What you see here is a trend for every user to develop a private library, inde- 
pendent of what it' in the central library. 

The next tendency that we noted has great implications, and that was the use of 
the computer as a communications device rather than as a text manipulating device. 

In the field of physics, for example, there are at MIT at least a dozen groups working 
on various aspects of solid state physics, who said to themselves: "let's get together 
and merge all our accumulated information and get rid of the duplicates and create 
a sort of grand MIT file on solid state physics." This is a communication device be- 
cause in a sense they published a bibliography for the small audience of the MIT solid 
state physicists. 

This is the way the on-line experience grew, rather than through intense, intim?.;: 
interaction with the literature. It grew in the direction of information manipulat. on, 
editing, sorting, adding, merging, rather than in an intensive aearch. 

Other things happened as time went on. It occurred to people that a text hand 1 ?*,* 
program such as we developed for information handling, can be used for other pu"j>c 
And ao a professor of physics, who also happened to be the associate dean of the* . .*ite 

school, said to himself: "if I can have a list of physics articles and retrieve 
want, why can't I al3o use the same program to have a list of graduate student . 
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details - age, name, department and so on, and use the same program for retrieving 
that* Requests came to us to generalize our programs, to handle things other than 
what we would ordinarily call document information because there are many files of 
information that are structurally or formally very similar. For example, the list of 
all the buildings at HIT and all their facilities, how many elevators, how many class- 
rooms and so on* Formally that file is not different from the list of all the arti- 
cles in some field of physics, and this has led to a broader and broader use of the 
program and more and more requests to expand the facilities* For example, the use 
of our retrieval program in non-literature search led to requests that we supply 
some primitive means for numeric analysis, e.g., to get the average of the salaries 
of professors or the average age of students. And so into our system went some minor 
arithmetic capacity* 

The surprise was in the use of the on-line; it did not develop in the direction 
that we expected, the intimate search and manipulation of the literature files, but 
rather in the sorting and arranging and formatting of information, adding new informa- 
tion, merging and so on* This is what saved the system from extinction, because had 
we insisted that the user must sit and interact with' the literature, I have a feeling 
that would not have happened and the system would not have gained acceptance* 

I should add one more expansion of the system, in two directions* Within the 
university, there are many disciplines that do not have national services, for example, 
musicology* We have to provide facilities for day to day file creation, editing, and 
formatting of these smaller enterprises whose need# are nevertheless to be satisfied 
in a university* We have been asked to do text analysis, such as frequency of words, 
word associations and so on, rather than retrieval* What I am describing here is a total 
system, the same set of programs built in such a way that they can be put together 
to perform various functions* This generalization of functions, their broadening, 
contributed to the acceptance of our system* 

We are using the same text management system in the area of library management, 
acquisitions, serial control and so on* Here again there was an expansion of utiliza- 
tion because acquisitions in the library, is really 9015 what business managers call 
purchasing; if you develop a system that handles library acquisition of books, the 
same system can be used to manage the buying of chalk and erasers and pencils and 
papers* 

This is the road we took to broaden the functional utility of the system in order 
to make it economically feasible* 

The system now is operational and it is beginning to look as if its initial goal, 
which was to provide an information retrieval system is at least economically a by- 
product of many other functions, that are really supporting the system financially, 
and we are in a way getting a free ride in the matter of information retrieval* 

We are now in the process of getting on to the third generation computer, so I 
have gone through the agonizing experience of writing programs, developing systems 
and getting them going, three times* The computer Industry and the computer engineers 
keep telling us that the next generation of the computer will be cheaper, faster, 
bigger - especially cheaper* It is obvious to everybody except those people who use 
computers that they are right* For the user of the computer, it takes two years to 
develop r, functioning system on a new computer - the programming, the arrangements, 
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Che management aspect, the introduction to the user population. That is a costly 
enterprise. And although it is clear that in some of the newer computers, given a 
three-year setting up period, the subsequent life will be more comfortable and cheaper 
and faster, the fact is that after three years, they will come and say, we now have a 
new one. We no sooner develop a comfortable environment in a computer and start 
profiting from its capability, then the industry introduces a new set* of computers 
and we have this initial, costly, painful and annoying work all over again. I think 
ve have come to the state where we must say, no, we will not accept new hardware. We 
have a set of problems that are facing us that can very well be solved and observed 
with the old hardware and the problems that I am talking about are educational - I 
don't want to use the word sociological because it's been used a great deal - it is 
the human configuration that has to be built into the computer and that can very 
well be done on the second generation computer* We need not go to the third. We 
are going to live with this computer for some time and we will resist very strongly 
any attempt from the computer engineers - and at HIT they are a sizeable lobby - 
from enticing us to a better, faster, cheaper machine, after this. 

We want to stabilize this aspect of the problem because there are many others 
that have to be looked at. 

Another problem that is very serious is the multiplicity of the data. This has 
been referred to again and again. At MIT, chemistry abstracts, physics abstracts, 
Euratom are all showering us with data bases. There is a great deal of duplication. 
Then a group of three or four professors comes and they say, ''look here, we are 
starting a new program in ecology. Everybody is excited about the quality of life 
and MIT should do something about it. The students are demonstrating, protesting. 

There is money in ecology." What is the data base for ecology? Chemistry, physics, 
engineering, economics, politics, aesthetics, everything. 

How can I respond to this type of need? What has happened is that this has be- 
come a national problem and sure enough there is a group in Raleigh, North Carolina 
that is going through all these various data bases, and making a tape called ecolo- 
gical abstracts. This simply cannot work. What is happening is that we are constantly 
repackaging the same data bases. Terh&ps we have to develop a technology of review 
writing, of collections of papers, and present that to the user population rather 
than the raw data of abstracts and indices. 

The other lesson that we have learned is that although MIT is a sizeable scienti- 
fic community, it is not large enough to support many rather obvious and useful appli- 
cations. The scaling factor is such that we cannot possibly run an economically 
self-sufficient application if we apply ourselves to MIT alone. For example, the 
manipulation of the census tapes which was completed in 1970 is a huge complex of 
data. Sociologists, economists, etc. are very much interested in it, but at MIT 
perhaps 50 people want to use it and the cost of processing it is huge and simply 
cannot be done on a local basis. We must regionalize a great many of our activities. 

In the States we find th/it the choice of the regional group is very sensitive; 
we are experimenting with conglomerates, such as Harvard, MIT and the Boston group 
of universities. There ia a beginning of that. 
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SYNOPSIS 

The Argonne Code Center serves as a central information agency 
and depository for computer programs primarily in the areas of 
nuclear physics, and reactor design, engineering, and operation. 

ACCESS, an acronym for Argonne Code Center Exchange and Storage 
System, identifies the information storage and retrieval system 
designed to automate the Center's operation. Implementation of 
ACCESS is on the Laboratory's IBM 360/75 computer. 

I. The Argonne Code Center 

In 1960, the U.S. Atomic Energy Commission, in response to a proposal by the 
Mathematics and Computation Division of the American Nuclear Society, approved the 
establishment of a central information agency and computer program library at 
Argonne National Laboratory. Membership of the Division, consisting mainly of 
physicists, nuclear engineers, mathematicians, and computer scientists, desired a 
formal center for the collection and transfer of information on computer programs 
of common interest. The U.S. Atomic Energy Commission viewed the Center as a 
means of maximizing its Investment in computer program research and development, 
and at Argonne the facility was established as an extension of the Applications 
Program Library within the Applied Mathematics Division* Today, while the Center 
remains within this Argonne division, it is a separately-funded activity of the 
Reactor Physics Branch of the USAEC Division of Reactor Development and Technology, 
with a European counterpart at the Euratom Research Center in Ispra, Italy, spon- 
sored by the European Nuclear Energy Agency. 

The Center has two products — the computer program abstract and the program 
"package." The abstract describes a computer program, the problem it is Intended 
to solve, the machine hardware required, and the material contained in the program 
package available upon request from the Center* The package consists of the 
material required to effectively transfer the program to another installation, 
perhaps even to a different computer configuration. It Includes tho computer 
source deck, run deck, sample problem, data libraries, auxiliary or adjunct routines, 
and the traditional documentation, such as the computer program report, user's 
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manual, and programmer's manual. A program abstract often describes more than one 
program package since the library collection Includes separate program packages for 
each machine version of a program. The programs in the library are Identified by 
accession number and classified into problem categories according to the type of 
problem they were created to solve. The current collection contains 306 programs 
distributed among 18 problem categories as shown In Table 1. These programs repre- 
sent a total of 432 program packages maintained and distributed by the Center. 



Tha activity of tha Argonne Coda Cantar can ba traatad In thraa parts: infor- 

mation aervlcea, conputar activities, and cooperative efforts. In carrying out thia 
thraa- part program, tha Cantar staff la supported by a network of "cooperative instal- 
lation" rapresentativas at participating USAEC laboratories, interested universities, 
USAKC-contrecting institutions, and industrial organisations. Each cooperating 



Number 

Classification Category Identification and Definition of Programs 



A. Cross section and resonance-integral calculations 

B. Spectrum calculations, generation of group constants, 



29 



lattice and cell problems 

C. Static design studies 

D. Depletion, fuel management, cost analysis, and reactor 



44 

43 



economics 

E. Space-independent kinetics 

F. Space- time kinetics, coupled neutronics-hydrodynamics- 



36 



11 



thermodynamics and excursion simulations 

G. Radiological safety, hazard and accident analysis 

H. Steady-state and transient heat transfer 

I. Deformation and stress distribution computations, struc- 



14 

19 

21 



tural analysis and engineering design studies 

J. Gamma heating and shield design problems 

K. Total systems analysis 

L. Data preparation 

M. Data management 

N. Subsidiary calculations 

O. Experimental data processing 

P. General mathematical and computing system routines 

Q. Radiation effects 
Z. Nuclear data 



Total 



27 

13 

1 

7 

7 

5 

13 

12 

2 

2 

306 



Table 1. Argonne Cod* Center Library by Problem Classlflcetlon, 
ANL-7411 Supplement 5 
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Installation designated a representative to serve as liaison with the Center. These i 

representatives relay Information to the Center concerning programs or requests orlg- j 

Inatlng within their Installation and apprise colleagues of the Center's program j 

collection and operating procedures. \ 

] 

Information services provided by the Center Include the preparation, editing, j 

and publication of the program abstracts, the answering of technical Inquiries on the j 

program collection — Its use and computer Implementation, the compilation of computer 
program bibliographies, and the reproduction and dissemination of the program pack* 
ages. Over 5000 program packages, or portions thereof, have been distributed since 
the Center was organised. A histogram of the yearly distribution statistics through j 

fiscal 1970 appears as Figure 1. j 




Through 6/30/65 1966 1967 I960 1969 1970 

Fiscal Year 



Figure 1. Humber of Program Packages Distributed, 1962*70 

Computer activities encompass the review and assimilation Into the library of 
the computer-media portion of the program package, the testing, evaluation, and main- 
tenance of the program collection, and automation of the Center's operation through 
the design and Implementation of ACCESS. The collection Is composed of program pack- 
ages offered the Canter by authors or Installation representatives, programs sub- 
mitted In compliance with USAEC-contract provisions, those obtained from the ENEA 
Computer Prograane Library through International agreement, and those solicited by 
the Center to mast outstanding or anticipated requests. The volume of program 
acquisition activity Is Indicated In Table 2. < 
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ENEA Computer 



Fiscal 

Year 


Argonne Code 
Center Programs 


Programme 
Library Programs 


Total 


1962 


41 




41 


1963 


40 




40 


1964 


48 




48 


1965 


39 


14 


53 


1966 


87 


16 


103 


1967 


52 


43 


95 


1968 


87 


34 


121 


1969 


65 


34 


99 


1970 


105 


10 


115 




564 


151 


715 


Table 2. 


Number of Program Packages 


Contributed to the Center, 


1962-1970 



Cooperative efforts cover activities of joint Interest undertaken with the par- 
ticipating Installation representatives, the Mathematics and Computation Division of 
the ANS, or the ENEA Library. In the past, these efforts have Included publication 
of a cooperating Installation computer facilities handbook* and a benchmark problem 
report, ^ preparation of documentation guidelines^ and recommended programming prac- 
tices for computer program exchange, and collaboration on program testing and stan- 
dardization of library Interchange procedures. These, and the other activities of 
the Argonne Code Center, were described In a paper presented last year at the ^ 
International Atomic Energy Agency's Symposium on Handling of Nuclear Information. 

II. ACCESS - An Overview of the System 

In mid-1967, when Argonne National Laboratory put into operation an IBM 360/50-75 
system with large on-line storage capacity and terminal editing capabilities, plans 
were made to automate the assimilation and distribution of the computer-media portion 
of the program packages and the maintenance of the extensive magnetic tape library. 
Preliminary specifications for ACCESS were laid out In November of 1967, Incorporating 
statistical and record-keeping functions of the Center along with the library automa- 
tion. The system as designed provides storage, retrieval, modification, and display 
facilities for Information utilized In the operation of the Center and the performance 
of Center services. Due to funding limitations Implementation of the system was 
delayed almost two years; programming started In 1969 and progress to date Is reported 
In this paper. The Laboratory computer configuration on which the system Is being 
built Is shown In Figure 2. 

Any computerized Information storage and retrieval system consists basically of 
two components: the stored information, or data base, and the computer programs 
written to create, update, manipulate, and retrieve this Information. ACCESS contains 
six Interrelated but dissimilar data bases: 

1. requests to the Center for computer programs; 

2. statistics maintained by the Center on program package transmittal; 

3. a directory, or table of contents, of the Center's program package library; 

4. descriptive abstracts of the library collection; 
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5. library program packages including such items as the source program, 

object or "run" program, input data for a sample problem execution, 
auxiliary library data and routines; 

6. information on cooperating installations or recipients of library programs, 

such as hardware facilities, mailing address, library program 
holdings, and selective dissemination of information (SDI) profile. 

Four of these are split into two segments — a "current" segment and a "history" 
segment to increase the efficiency of the system and to provide an archival facility 
at reduced cost using off-line rather than on-line devices for information storage. 

The system is programmed as a collection of modular programs, each written to 
perform a specific task utilizing one or more of the data bases. A single program 
or sequence of programs, or tasks, may be executed at run time. Three types of 
modules exist in ACCESS. These are identified as the file construction and mainte- 
nance modules, the file editing or report-generating moudles, and the special func- 
tion or task modules. This last type includes many program modules written to 
automate existing manual or semi-automatic Center procedures, such as the preparation 
of the semiannual supplements to the compilation of program abstracts published by 
the Center. ^ A standard ACCESS vocabulary consisting of file-maintenance and report- 
generating verbs and data-identif ier nouns has been defined and is used throughout 
the system. The commands ADD, MODIFY, DELETE, and PRINT direct the updating and 
checking of the data bases while the verbs SELECT, ORDER, and EDIT provide the 
retrieval and editing capabilities. FINIS is used universally to terminate a command 
string. Data identifiers such as NUM for library accession number, NAM for the pro- 
gram or installation title, CAT for the problem classification or distribution 
category, ITM for the file entry’s item identification, and KEY for SDI keyword or 
Boolean keyword-combination are invoked with these verbs to update data bases and to 
prepare publications, reports, and transmittal labels. Few commands are required with 
the function modules, where programmed conventions and default options are applied; 
however, the two general terms OVERRIDE and SPECIAL have been adopted to overide 
programmed conventions when required and to permit substitution of input data for file 
information on a temporary basis. 

A schematic diagram of ACCESS is presented in Figure 3 and a description of the 
system data bases is given in the following section. 

III. The ACCESS Data Bases 

As noted earlier, £he six data bases while interrelated are quite dissimilar in 
structure and size. Before proceeding with a description of each of the six individ- 
ual data bases, a look at the over-all system data base characteristics is in order. 

By far, the largest data base in terms of storage requirements is the program package 
collection; it Is also one of those most frequently accessed. Four of the six data 
bases have both current and history segments. In each case, the history segment has 
been designed for retention on magnetic tape. The two remaining, the request and the 
statistics files, do not contain archival data. Accompanying each data base is a 
directory containing the pointers and other control information used by the program 
modules to access selected entries, or particular items within file entries, to ex- 
pedite the Information retrieval function. The program modules utilize the structure 
definition provided by the PL/1 programming language to accomplish this. Table 3 
summarises the estimated maximum data base storage requirements for the system and 
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for each of the Individual files, and indicates the size of each established file at 
this time* To date, no effort has been made to create either the request or statis- 
tics data base; this will be undertaken when the package file is established. 



STORAGE REQUIREMENTS 



Identi- 




Est. 


Maximum 


Implementation 




fication 


Contents 


Entries 


Bytes 


Entries 


Bytes 


DEVICE 


REQ 


Unanswered and in- 
complete requests 


2,000 


724K 


- 


- 


IBM 2314 


STA 


Transmittal 

statistics 


4,000 


896K 


- 


- 


IBM 2314 


TOC* 


Directory of pro- 
gram package 
collection 


2,000 


240K 


432 


52K 


IBM 2314 


ABS* 


Program abstracts 


1,000 


7.200K 


306 


2203K 


IBM 2314 


PKG* 


Program packages 


2,000 


300.000K 


6 


720K 


IBM 2314 
IBM 2321 


CIR* 


Coop installations 
Other recipients 
Library holdings 


500 

680 

18,000 


121K 

128K 

144K 


105 

678 

5,000 


30K 

127K 

65K 


IBM 2314 



^History segments of these files reside on magnetic tape. 



Table 3: ACCESS Data Base Storage Requirements 

The request (REQ) data base was designed to replace a manual "pending" file 
mintalned in the Center office* The Center receives, in addition to the routine 
requests for copies of material in the library collection, queries on the availabil- 
ity of programs known only by title or by citation, and requests for information on 
nonlibrary programs* In such cases the staff attempts to locate the appropriate 
program and to obtain the information, or the program, desired* The ACCESS request 
file is set up to accommodate as many as 2000 pending program requests and queries 
at any time* It is a fixed-format file, structured to permit as many as 11 informa- 
tion items for each entry* These information items are: 

1* request number, 

2. date of entry, 

3* cooperating or requesting installation identification, ••• 

4* program package identification, 

5* package contents symbols, 

6* override option codes, * 

7* special requester's name, 

8* address option codes, 

9* special address information, 

10* special transmittal instructions, and 

11* Center comment or miscellaneous information* 

Both items 1 and 2 are assigned automatically at the time the request is entered into 
the file* 
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